Nucleic acid encoding amyloid precursor-like protein and uses thereof

ABSTRACT

The present invention is directed to isolated amyloid precursor-like proteins (APLPs), nucleotide sequences coding for and regulating expression of these protein, antibodies directed against these proteins, and recombinant vectors and host cells containing the genetic sequences coding for and regulating the expression of these protein sequences. The invention is also directed to isolated genomic DNA, cDNA anti-sense RNA, and RNA containing the protein sequence. Antibodies can be used to detect an APLP in biological specimens, including, for example, fluid, serum or tissue samples. APLP1 and APLP2 are candidate genes for late onset familial Alzheimer&#39;s disease.

The present invention was made utilizing funds of the United States Government. The United States Government is hereby granted a worldwide royalty fee, paid up, non-exclusive license in the invention.

This application is a continuation-in-part application of U.S. application Ser. No. 07/930,022 filed Aug. 17, 1992, now abandoned, which is a continuation-in-part application of U.S. application Ser. No. 07/872,642 filed Apr. 20, 1992, now abandoned, which disclosures are herein incorporated in their entirety.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to amyloid precursor-like proteins (APLPs) including APLP1 and APLP2; products and processes involved in the cloning, preparation, and expression of genes for APLPs; antibodies with specificity to these proteins and to diagnostic and therapeutic uses thereof.

2. Brief Description of the Background Art

Alzheimer's disease (AD) is a neurodegenerative disorder characterized by the presence of neurofibrillary tangles, and amyloid in the form of senile plaques and blood vessel deposits in the brain. The main component of amyloid is Aβ, a 39-43 amino acid peptide derived from a larger amyloid precursor protein (APP) encoded by a gene on chromosome 21 (Glenner and Wong, Biochem. Biophys. Res. Commun. 120:885-890 (1984); Goldgaber et al., Science 235:877-880 (1987); Kang et al., Nature 325:733-736 (1987); Robakis et al., Proc. Natl. Acad. Sci. USA 84:4190-4194 (1987); Tanzi et al., Science 235:880-884 (1987)). A proportion of AD is inherited (reviewed in St. George-Hyslop et al., Neurobiol. Aging 10:417-425 (1989)) and the familial form (FAD) has been clearly shown to be a genetically heterogeneous disorder (Schellenberg et al., Science 241:1507-1510 (1988); St. Geroge-Hyslop et al., Nature 347:194-197 (1990)). An early-onset (<65 years of age) FAD locus was mapped to chromosome 21 (St. Geroge-Hyslop et al., Science 235:885-889 (1987)), and a small percentage (less than 3%) of FAD is linked to point mutations within the APP gene (Goate et al., Nature 349:704-706 (1991); Murrell et al., Science 254:97-99 (1991); Chartier-Harlin et al., Nature 353:884-846 (1992); Tanzi and Hyman, Ann. N.Y. Acad. Sci 640:149-154 (1992); Mullan et al., Nature Genetics 2:340-342 (1992)). More recently, studies revealed the presence of a major early-onset FAD locus on chromosome 14 (Mullan et al., Nature Genetics 2:340-342 (1992); Schellenberg et al., Science 258:668-673 (1992); St. George-Hyslop et al., Nature Genetics: in press (1992); Broeckhoven et al., Nature Genetics 2:335-339 (1992)). Some early-onset FAD pedigrees do not demonstrate linkage to either the APP gene or chromosome 14 (Schellenberg et al., Science 241:1507-1510 (1988); St. Geroge-Hyslop et al., Nature 347:194-197 (1990); Schellenberg et al., Science 258:668-673 (1992); Tanzi, R. E., Am. J. Human Genet. 51:273-282 (1992)).

A subset of, but not all late-onset (>65 years of age) FAD pedigrees appear to be linked to a locus on chromosome 19 (Pericak-Vance et al., Am. J. Hum. Genet. 48, 1034-1050 (1991)). Therefore, in addition to genes on chromosomes 14, 19 and 21 there would appear to be at least two additional FAD loci that remain unmapped. Candidate genes for other FAD loci include potential APP gene homologues which might either serve as alternative substrates for amyloid formation, or interfere with normal processing of APP.

APP resembles an integral membrane protein with a small cytoplasmic C-terminal domain and a larger, extracellular N-terminal domain (Kang et al., Nature 325:733-736 (1987)). The APP gene produces at least six transcripts (Ponte et al., Nature 331:525-527 (1988); Tanzi et al., Nature 331:528-530 (1988); Kitaguchi et al., Nature 331:530-532; De Sauvage et al., Science 245:651-655 (1989); Jacobsen et al., Neurobiol. Aging 12:575-583 (1991); Konig et al., J. Biol. chem. 267:19804-19809 (1992)), five of which include an alternatively spliced exon encoding a Kunitz type protease inhibitor domain. APP appears to be processed through at least two pathways. A secretory pathway results in the cleavage of APP at a site near the membrane of extracellular-junction, within the βA4 domain (Esch et al., Science 248:1122-1124 (1990); Sisodia et al., Science 248:492-495 (1990)), and precludes the formation of Aβ. Alternatively, processing in the endosomal-lysosomal pathway leads to the production of carboxy terminal derivatives that contain the Aβ peptide (Estus et al., Science 255:726-728 (1991); Golde et al., Science 255:728-730 (1991); Haass et al., Science 357:500-503 (1992)). More recently, it has been demonstrated that Aβ is generated in a soluble form by cultured cells (Haass et al., Nature 359:322-325 (1992); Sebubert et al., Nature 359:325-327 (1992); Shoji et al., Science 258:126-129 (1992)), although the exact pathway leading to the generation of soluble Aβ is unknown. Presumably, a precise balance among these pathways must be maintained in order to avoid the overproduction of amyloidogenic fragments.

It has previously been hypothesized that proteins resembling APP in amino acid sequence and overall structure may interact with, and effectively compete for factors playing a role in the maturation and/or metabolism of APP (Wasco et al., Genomics: in press (1992); Wasco et al., Proc. Natl. Acad. Sci. USA 89:10758-10762 (1992)). Recent evidence indicates that APP is a member of a family of proteins with specific domains that have been remarkably well conserved.

cDNA clones predict that APP is a transmembrane protein with a small intracellular C-terminal domain and a larger, extracellular N-terminal domain (Goldgaber et al., Science 235:877-880 (1987); Kang et al., Nature 325:733-736 (1987); Robakis et al., Proc. Natl. Acad. Sci. USA 84:4190-4194 (1987); and Tanzi et al., Science 235:880-884 (1987)). There are at least four forms of APP that result in proteins of 563, 695, 751 and 770 amino acids (De Sauvage et al., Science 245:651-653 (1989); Kitaguchi et al., Nature 331:530-532 (1988); Ponte et al., Nature 331:525-527 (1988); and Tanzi et al., Nature 331:528-530 (1988)). At least three of these are the result of alternative splicing (Kitaguchi et al., Nature 331:530-532 (1988); Ponte et al., Nature 331:525-527 (1988); and Tanzi et al., Nature 331:528-530 (1988)). The two larger forms of APP contain a 56 amino acid insert that has homology to the KuNa family of protease inhibitors (Kitaguchi et al., Nature 331:530-532 (1988); Ponte et al., Nature 331:525-527 (1988); and Tanzi et al., Nature 331:528-530 (1988)).

One of the secreted forms of APP that contains a protease inhibitor domain has been found to be identical to protease nexin-II (Oltersdorf et al., J. Biol. Chem. 265:4492-4497 (1990); and Saitoh et al., Cell 58:615-622 (1989)).

The βA4 domain spans the predicted membrane-extracellular junction of APP. How this peptide is liberated from the precursor protein in Alzheimer's disease remains unclear. Results from a number of laboratories suggest that normally the precursor protein is secreted via cleavage at sites that are at or near the membrane-extracellular junction, apparently within the βA4-domain (Esch et al., Science 248:1122-1124 (1990); Selkoe et al., Proc. Natl. Acad. Sci. USA 85:7341-7345 (1988); Weidman et al., Cell 57:115-126 (1989); and Sisoda et al., Science 248:492-495 (1990)). If these findings are correct, normal processing of the secreted form of APP would preclude the formation of βA4, and therefore, of amyloid itself. Such a model implies that βA4 contained within the plaques and cerebrovascular deposits is the result of an abnormal processing event.

SUMMARY OF THE INVENTION

Presently, human and mouse cDNA clones encoding an amyloid precursor-like protein APLP1 on human chromosome 19 that is similar to APP in overall structure and amino acid sequence, have been isolated (Wasco et al., Genomics: in press (1992); Wasco et al., Proc. Natl. Acad. Sci. USA 89:10758-10762 (1992)).

Specifically, a mouse brain cDNA that encodes a protein whose predicted amino acid sequence is 42% identical and 64% similar to APP, has presently been identified. This amyloid precursor-like protein (APLP1) is clearly not encoded by the mouse homologue of human APP which homologue has been cloned previously and found to be 96.8% identical to human APP (Yamada et al., Biochem. Biophys. Res. Comm. 158:9060912 (1987)). An APP-like gene has also been isolated from Drosophila (APPL; Rosen et al., Proc. Natl. Acad. Sci. USA 86:2478-2482 (1989)) and another has been reported in the form of a partial cDNA clone isolated from a rate testes library (Yan et al., Proc. Natl. Acad. Sci. USA 87:2405-2408 (1990)). Interestingly, Luo et al. (Luo et al., Neuron 9:595-605 (1992)) have demonstrated a functional homology between APPL and APP. Transgenes expressing human APP or Drosophila APPL both show a similar level of rescue of Appl- fast phototaxis defect Drosophila mutants. These data would indicate that APP is a member of an evolutionarily highly conserved family of proteins that may share common functions and interact with similar proteins.

The present APLP1 shares three distinct regions of homology with APP. It is encoded by a 2.4 kb message that is present in mouse brain and neuroblastoma cells. Antibodies raised against a synthetic peptide derived from the C-terminus of APLP1 stain the Golgi in neuroblastoma cells.

In an attempt to isolate other members of the APP protein family, the mouse APLP1 sequence was first used to scan the Genbank database for homologous sequences. In addition to obtaining matches for APP, APPL, and the partial cDNA from rate testes, a match with an anonymous 274 base pair human brain cDNA entry (Genbank accession number M78104), was noted. This match, which was significant but not identical to mouse APLP1 (63% identity), indicated that M78104 was a small piece of a cDNA encoding a second APLP. In order to characterize the APP-like gene family in more detail, full length cDNAs for this second APLP, APLP2, were isolated. The isolation and characterization of APLP2 cDNA clones from human brain, provide further support for the hypothesis that APP is a member of a highly conserved gene family.

The cDNA described here, together with APP and two other genes--one from Drosophila (Rosen et al., Proc. Natl. Acad. Sci. USA 86:2478-2482 (1989)), and one from rat (Yan et al., Proc. Natl. Acad. Sci. USA 87:2405-2408 (1990))--encode a family of proteins with conserved domains of homology which may share common functions. The present APLP is a membrane associated protein which is localized to the golgi.

The present invention relates to amyloid precursor-like proteins (APLPs), products and processes involved in the cloning, preparation and expression of genes for APLPs; antibodies with specificity to these proteins; and nucleotide probes corresponding to an APLP or portions of an APLP polypeptide. An APLP polypeptide is useful for producing antibodies thereto. The antibodies and probes are useful for detecting the presence of an APLP in biological specimens, including, for example, body fluid, serum or tissue samples from an animal or human.

The present invention relates to the identification, characterization and sequencing of cDNAs and genomic fragments which encode APLPs that are present in mammalian cells including brain cells and neuroblastoma cells.

According to the invention, there are provided genetic sequences encoding APLPs. The invention also provides for expression vectors containing such genetic sequences, hosts transformed with such expression vectors, and methods for producing the genetically engineered or recombinant APLPs.

The present invention also provides antibodies which specifically recognize an APLP.

The present APLP cDNA and recombinant protein are useful for making antibodies which specifically recognize an APLP. Such antibodies are useful for detecting an APLP in a biological specimen.

A cDNA from a mouse brain library that encodes a protein whose predicted amino acid sequence is 42% identical and 64% similar to the amyloid precursor protein (APP) has been isolated. This 653 amino acid amyloid precursor-like protein (APLP1) is similar to APP in overall structure as well as amino acid sequence. The amino acid sequence homologies are concentrated within three distinct regions of the two proteins where the identities are 47%, 54% and 56%. All three regions are also strongly conserved in the Drosophila APP-like gene APPL (Rosen, et al., Proc. Natl. Acad. Sci. USA 86:2478-2482 (1989)).

Notably, twelve cysteine residues are conserved in the cysteine-rich region of both APLP1 and APLP2, as are an extracellular N-glycosylation site and the cytoplasmic clathrin-binding domain. The cytoplasmic domain is also conserved in a partial cDNA clone reported to encode an APP-like gene in rat testes (Yan et al. Proc. Natl. Acad. Sci. USA 87:2405-2408 (1990)). These data suggest that APP is part of a highly conserved gene family.

The present mouse APLP1 cDNA hybridizes to two messages of approximately 2.4 and 1.6 kb that are present in mouse brain and neuroblastoma cells.

APLP1 has been mapped to chromosome 19 within the same general region as the locus that has been reported to be linked to a form of late-onset familial Alzheimer's disease. Specifically, the APLP1 gene locus is located on the long arm of chromosome 19 between 19q13.2 and the centromere. In situ hybridization studies show APP and APLP to be expressed in similar human brain regions and cell sub-populations. APLP1 is a candidate gene for the gene defect for late-onset familial Alzheimer's disease.

Human APLP2 is encoded by a 706 amino acid sequence that is similar to APP and APLP1 in overall structure as well as amino acid sequence. APLP2 is 52% identical, 69% similar to APP695 and 43% identical, 63% similar to APLP1. Virtually all of the identified domains and motifs that characterize APP, APPL and APLP1 are present in APLP2. Specifically, an N-terminal cysteine-rich region (consisting of 12 cysteines), a novel zinc-binding motif (Bush et al., Neurobiol. Aging 13 (supplement 1):A.331 (1992)), an acidic-rich domain, N-glycosylation sites, a hydrophobic membrane spanning domain and a cytoplasmic domain containing a clathrin binding motif and potential serine/threonine, casine kinase I, II and tyrosine phosphorylation sites are conserved in APLP2. APLP1 and APLP2 are candidate genes for late onset familial Alzheimers disease.

DESCRIPTION OF THE FIGURES

FIG. 1. Schematic representation of the mouse APLP1 open reading frame and the relation of various cDNA clones. The 2361 base pair open reading frame and the non-coding region of the APLP1 cDNA are shown. Also shown are the relative locations of two representative cDNA clones found in 11 libraries, 69A and 1A, and a clone obtained through the RACE procedure, J. Restriction enzyme sites: E=EcoRI, P=PstI.

FIGS. 2A-2E. Nucleotide and amino acid sequence of the APLP1 cDNA. The composite nucleotide sequence (SEQ ID NO:2) and the predicted amino acid sequence (SEQ ID NO:3) of APLP1 is shown. The predicted membrane spanning region is underlined. The location of the primers that were used for the RACE procedures are indicated by arrowed lines over the nucleotide sequences. The location of the peptide sequence used for the production of antisera is double underlined. Predicted N-glycosylation sites are underlined with a squiggly line and a region surrounding a potential tyrosine phosphorylation site is underlined by dots. The polyadenylation signal is indicated by bold face type and the stop codon is shown by an asterisk.

FIGS. 3A-3B. Comparison of the APLP1 (SEQ ID NO:3) and APP (SEQ ID NO:5) amino acid sequences. The UWGCG Bestfit analysis of the mouse APLP1 and human APP 695 (Chen, et al., J. Biol. Chem. 265:3116-3123 (1990)) is shown. Identities are indicated by a vertical line between the two amino acids. Similarities are indicated by a single or double dot. Gaps produced by the Bestfit alignment are shown by dots in the sequence. The βA4-protein sequence is underlined in the APP sequence. The identities are concentrated in three regions: APLP amino acids 21-211, 316-488, and 609-654.

FIGS.4A-4B. Domains of homology. Regions of the amino acid sequences of the mouse amyloid precursor like protein (APLP1) (SEQ ID NOS: 6, 7, 8), the human amyloid precursor protein APP (SEQ ID NOS: 9, 10, 11), the Drosophila amyloid precursor-like protein (APPL1) (SEQ ID NOS: 11,12, 13) and the rat testis cDNA (testis) (SEQ ID NOS: 13, 14, 15) are compared. Amino acids that are identical in all of the sequences in the domain are shown as capital letters in bold face type and are identified by the presence of a vertical line (|) above the sequences. Amino acids that are the same in more than one sequence are shown as capital letters and have a dot (°) over the sequences. Amino acids that are not identical to any others are shown as lower case letters. The conserved cysteines are identified by the presence of a carrot underneath the sequence. Spans of particularly conserved amino acids are underlined. An N-glycosylation signal is identified by a double underline. Stop codons are indicated by an asterisk and the amino acid numbers of the sequences are shown at the beginning of each line.

FIG. 5. Northern blots of mouse brain and neuroblastoma RNA. Poly A+RNA (10 μg) from neuroblastoma (lane 1) and mouse brain (lane 2) was separated on agarose gels containing formaldehyde and transferred to nylon as described in Materials and Methods. The blot was probed with DNA corresponding to nucleotides 1482-1995 of the nucleotide sequence shown in FIGS. 2A-2E. Sizes of hybridizing messages in kb are indicated.

FIGS. 6A-6C. Western Blots using antiserum 301 (See the Example). Mouse brain and neuroblastoma proteins were separated by 7.5% PAGE as described in Materials and Methods.

(A) Mouse brain protein was probed with antiserum 301 or preimmune serum at a dilution of 1:100. The binding of antiserum 301 to the 65 and 33 kDa proteins is inhibited by the presence of increasing amounts of the peptide QQLRELQRH (SEQ ID NO:1), used to immunize the rabbit. Preimmune serum with no peptide (lane 1); immune serum with no peptide (lane 2); immune serum preabsorbed with 5 ng/ml peptide (lane 3); immune serum preabsorbed with 50 ng/ml of peptide (lane 4); immune serum preabsorbed with 500 ng/ml of peptide (lane 5). Preabsorption of immune serum with 500 ng of an irrelevant yeast β-tubulin peptide had no effect on the binding (lane 6).

(B) Neuroblastoma cell extracts probed with preimmune serum (lane 1) and 301 antiserum (lane 2). Both sera were used at a dilution of 1:100.

(C) Anti-peptide (QQLRELQRH (SEQ ID NO:1)) antiserum recognizes a β-galactosidase-APLP1 fusion protein. Western blots on bacterially produced proteins. Lanes 1-3 were stained with preimmune serum from rabbit 301. Lanes 4-6 were stained with immune serum. Lanes 1 and 4: induced (temperature sensitive induction and promoter) cells containing a plasmid with its β-galactosidase gene fused to an APLP1 cDNA fragment inappropriately oriented for production of an APLP1 epitope. Lanes 2 and 5: uninduced cells containing a plasmid with its β-galactosidase gene fused in frame to the APLP open reading frame. Lanes 3 and 6: same cells as in lanes 2 and 5 except induced. Induced cells were grown at 42° C. Uninduced cells were grown at 30° C. The arrowhead indicates a β-galactosidase-APLP1 fusion protein recognized by immune serum but not by preimmune serum. That protein is approximately 24 kDa larger than β-galactosidase alone, as predicted, due to the insertion of 222 additional residues of APLP1 open reading frame.

FIGS. 7A-7E. Immunofluorescence staining of mouse neuroblastoma cells with antiserum 301. Cells were stained with antiserum 301 at a dilution of 1:10,000 as described in Materials and Methods. FIG. 7A shows neuroblastoma cells stained with antiserum 301. FIG. 7B shows a higher magnification of a cell stained with antiserum 301 where the reticular pattern is evident. This staining pattern is similar to that seen when an antibody to a known Golgi enzyme (mannosidase II) is used to stain the cells (FIG. 7C). The perinuclear staining is competed by the addition of the peptide that was used as the antigen (FIG. 7D), and is not seen in the presence of preimmune serum (FIG. 7E). The magnification in FIG. 7A and FIGS. 7C-7E is 720× and in FIG. 7B 950×.

FIG. 8. Mapping of the APLP1 locus using a somatic cell hybrid panel. All hybrids have been previously described (Brook et al., Hum. Genet. 87:65-72 (1991); Chartier-Harlin et al., Nature 353:884-846 (1991); and Geissler et al., Cell Mol. Genet. 17:197-214 (1991)). The portions of chromosome 19 retained in each human-rodent cell hybrid are illustrated, and the names of the representative cell lines are shown above. The presence (+) or absence (-) of APLP1 in each hybrid cell line is indicated.

FIGS. 9A-9B. Comparison of APLP2 (SEQ ID NO:4) and APP (SEQ ID NO:5) animo acid sequences. An alignment of the human APLP2 amino acid sequence and the human APP695 (Kang et al., Nature 325:733-736 (1987)) was generated using the UWGCG GAP analysis. Gaps produced by the alignment are indicated by dots in the sequence. The location of the four PCR primers that were used to generate the SG190 probe (see Example 3) are indicated by arrows above the amino acid sequence. Twelve conserved cysteines are indicated by carets ( ) under the sequence and a zinc-binding motif is indicated by a double underline. A conserved acidic-rich region is located between APLP2 amino acids 216 and 278. An N-glycosylation signal is underlined, an alternatively spliced exon is overlined, predicted transmembrane regions are shown in italics, and a clathrin binding motif is indicated by bold face type. Potential phosphorylation sites are indicated by a # sign (protein kinase C), a sign (caseine kinase I and II) or a ° sign (tyrosine kinase) over the sequence. Stop codons are indicated by an asterisk.

FIGS. 10A-10B. Alignment of the amino acid sequences of the members of the APP-gene family. The sequences of human APP (SEQ ID NO:5) human APLP2 (SEQ ID NO:4), mouse APLP1 (SEQ ID NO: 3) (Wasco et al., Proc. Natl. Acad. Sci. USA 89:10758-10762 (1992)) are presented as aligned by the UWGCG PILEUP program. Amino acids that are not identical or conservatively substituted are indicated by a dash. Gaps in the amino acid sequence that were created by the alignment are shown by dots. The predicted initiator methionine for each protein is shown and stop codons are indicated by an asterisk.

FIGS. 11A-11B. Distribution of human APLP2 gene transcripts. RNA from fetal human tissues (11A) or adult human brain (11B) was isolated, fractionated, transferred to nylon membranes, and hybridized with radiolabeled probe as previously described (Tanzi et al., Science 235:880-884 (1987)). (11A) Hybridization of a PCR fragment corresponding to APLP2 amino acids 327 to 490 (APLP2) or a 3'1.1 kb EcoRI APP cDNA fragment (FB63) were hybridized to RNA (20 μg) from human 20-22 week aborted fetal tissue obtained midtrimester under protocols approved by the institutional review board at Brigham and Women's Hospital. (11B) Hybridization of the APLP2 PRC fragment or FB63 to RNA (10 μg) from adult human brain subregions: A10, frontal cortex; A17, striate cortex; A18, extrastriate cortex, A20, 21 temporal association cortex; A4, motor cortex; thalamus-VPL, thalamus-ventral posterolateral nucleus; A40, posterior perisylvian cortex-supramarginal gyri; A44, anterior perisylvian cortex-opercular gyri. Shown beneath FIG. 11B is a control hybridization with a glyceraldehyde-3-phosphate dehydrogenase cDNA (G3PD). The two autoradiograms are from independent hydribizations to the same filter.

FIG. 12. Northern blot of APLP2 to total RNA from normal and Down syndrome brains, adult normal and ADA cerebellum and frontal cortex. A PCR generated fragment corresponding to APLP2 amino acid 327 to 490 and FB63 (APP) were hybridized to total RNA (25 μg) from 19-week normal (N) and Down syndrome (DS) brains, adult normal (N Cb) cerebellum, and adult normal (N FCtx) and AD (AD FCtx) frontal cortex. The two autoradiograms are from independent hybridizations to the same filter.

FIGS. 13A-13B. Nonisotopic in situ localization of APLP2-oligonucleotide. (A) In situ hybridization using a 45-mer specific for APLP2 (corresponding to amino acids 74-88 in FIGS. 9A-9B) reveals staining of CA1 pyramidal neurons. The probe was end-labeled with biotin-21-dUTP using 3' terminal transferase and visualized by avidin-biotin-peroxidase reaction (Tanzi et al., Mol. Brain Res.: in press; Hyman et al., Mol. Brain Res.: in press; Wasco et al., Alzheimer's disease and related disorders 1992: selected communications (in press)). (B) a negative control 45-mer corresponding to the other strand of the same region of APLP2 shows no significant staining. Magnification=16×0.3.

DETAILED DESCRIPTION OF THE INVENTION

I. Definitions

To aid in the understanding of the specification and claims, including the scope to be given such terms, the following definitions are provided.

Gene. By the term "gene" is intended a DNA sequence which encodes through its template or messenger RNA a sequence of amino acids characteristic of a specific peptide. Further, the term includes intervening, non-coding regions, as well as regulatory regions, and can include 5' and 3' ends.

Gene sequence. The term "gene sequence" is intended to refer generally to a DNA molecule. As used herein, the term is meant to include both a DNA molecule which contains one or more genes, or gene fragments, as well as a DNA molecule which contains a non-transcribed or non-translated sequence. The term is further intended to include any combination of gene(s), gene fragment(s), non-transcribed sequence(s) or non-translated sequence(s) which are present on the same DNA molecule.

The present sequences may be derived from a variety of sources including DNA, cDNA, synthetic DNA, RNA, or combinations thereof. Such gene sequences may comprise genomic DNA which may or may not include naturally occurring introns. Moreover, such genomic DNA may be obtained in association with promoter regions or poly A sequences. The gene sequences, genomic DNA or cDNA may be obtained in any of several ways. Genomic DNA can be extracted and purified from suitable cells, such as brain cells, by means well known in the art. Alternatively, mRNA can be isolated from a cell and used to produce cDNA by reverse transcription or other means.

cDNA. The term "cDNA" includes genes from which the intervening sequences have been removed.

Recombinant DNA. By the term "recombinant DNA" is meant a molecule that has been recombined by in vitro splicing cDNA or a genomic DNA sequence.

Cloning Vehicle. A plasmid or phage DNA or other DNA sequence which is able to replicate in a host cell. The cloning vehicle is characterized by one or more endonuclease recognition sites at which such DNA sequences may be cut in a determinable fashion without loss of an essential biological function of the DNA, which may contain a marker suitable for use in the identification of transformed cells. Markers include, for example, tetracycline resistance or ampicillin resistance. The word vector can be used to connote a cloning vehicle.

Expression Control Sequence. A sequence of nucleotides that controls or regulates expression of structural genes when operably linked to those genes. They include the lac systems, the trp system major operator and promoter regions of the phage lambda, the control region of fd coat protein and other sequences known to control the expression of genes in prokaryotic or eukaryotic cells.

Expression vehicle. A vehicle or vector similar to a cloning vehicle but which is capable of expressing a gene which has been cloned into it, after transformation into a host. The cloned gene is usually placed under the control of (i.e., operably linked to) certain control sequences such as promoter sequences. Expression control sequences will vary depending on whether the vector is designed to express the operably linked gene in a prokaryotic or eukaryotic host and may additionally contain transcriptional elements such as enhancer elements, termination sequences, tissue-specificity elements, and/or translational initiation and termination sites.

Functional Derivative. A "functional derivative" of APLP is a protein which possesses a biological activity (either functional or structural) or immunological characteristics that are substantially similar to a biological activity or immunological characteristics of non-recombinant APLP. A functional derivative of APLP may or may not contain post-translational modifications such as covalently linked carbohydrate, depending on the necessity of such modifications for the performance of a specific function. The term "functional derivative" is intended to include the "fragments," "variants," "analogues," "homologues," or "chemical derivatives" of a molecule.

Fragment. A "fragment" of a molecule such as APLP is meant to refer to any variant of the molecule.

Variant. A "variant" of a molecule such as APLP is meant to refer to a molecule substantially similar in structure and biological activity or immunological characteristics to either the entire molecule, or to a fragment thereof. Thus, provided that two molecules possess a similar activity, they are considered variants as that term is used herein even if the composition or secondary, tertiary, or quaternary structure of one of the molecules is not identical to that found in the other, or if the sequence of amino acid residues is not identical.

Analog. An "analog" of a molecule such as APLP is meant to refer to a molecule substantially similar in function to either the entire molecule or to a fragment thereof. As used herein, a molecule is said to be a "chemical derivative" of another molecule when it contains additional chemical moieties not normally a part of the molecule. Such moieties may improve the molecule's solubility, absorption, biological half life, etc. The moieties may alternatively decrease the toxicity of the molecule, eliminate or attenuate any undesirable side effect of the molecule, etc. Moieties capable of mediating such effects are disclosed in Remington's Pharmaceutical Sciences (1980). Procedures for coupling such moieties to a molecule are well known in the art.

Operator. A DNA sequence capable of interacting with the specific repressor, thereby controlling the transcription of adjacent gene(s).

Promoter. The term "promoter" is intended to refer to a DNA sequence which can be recognized by an RNA polymerase. The presence of such a sequence permits the RNA polymerase to bind and initiate transcription of operably linked gene sequences.

Promoter region. The term "promoter region" is intended to broadly include both the promoter sequences as well as all gene sequences which may be necessary for the initiation of transcription. The presence of a promoter region is, therefore, sufficient to cause the expression of an operably linked gene sequence.

Operably Linked. As used herein, the term "operably linked" means that the promoter controls the initiation of expression of the gene. A promoter is operably linked to a sequence of proximal DNA if upon introduction into a host cell the promoter determines the transcription of the proximal DNA sequence or sequences into one or more species of RNA. A promoter is operably linked to a DNA sequence if the promoter is capable of initiating transcription of that DNA sequence.

Prokaryote. The term "prokaryote" is meant to include all organisms without a true nucleus, including bacteria.

Host. The term "host" is meant to include not only prokaryotes, but also such eukaryotes as yeast and filamentous fungi, as well as plant and animal cells. The term includes any organism or cell that is the recipient of a replicable expression vehicle.

Amyloid Precursor-Like Protein (APLP). This term is meant to include any amyloid precursor-like protein, from any species, including APLP1 and APLP2, especially from human brain, Alzheimer's disease human brain, or a synthetic APLP. The present APLP exhibits at least 40% identity at the amino acid level to APP and/or APLP1 and/or APLP2, more preferably at least 50% identity thereto, and contains an N-terminal cysteine-rich region consisting of at least 10 cysteines, more preferably consists of 12 cysteines. The term is also used in this invention to include any analog, homolog, mutant or derivative of a naturally occurring APLP. The term is also meant to include fragments having less than the naturally occurring number of amino acids, such as partial fragments of natural or synthetic APLP which retain the biological or immunological characteristics of the polypeptides specifically disclosed in this application. The term is also used to include any product which comprises the sequence of a naturally occurring APLP, or analog or homolog thereof, together with one or more flanking amino acids, which still have the same biological or immunological characteristics.

The term is also used to include any peptide which comprises the sequence of a naturally-occurring APLP or an analog thereof together with one or more flanking amino acids, which have the same biological (functional or structural) or immunological characteristics.

The present invention pertains both to expression of full-length APLP's (i.e. APLP1 and APLP2), and to the functional derivatives of these proteins. This term also includes genetic alleles of APLP's including APLP1 and APLP2.

Substantially homologous. The term "substantially homologous" as used herein refers to the ability of a first DNA sequence encoding an APLP to hybridize to a second DNA sequence encoding an APLP under stringent conditions, for example, at about 0.1× sodium citrate sodium chloride buffer (SSC) at a temperature of about 65° C. For example, if an APLP variant is substantially homologous to a mouse APLP, a DNA sequence encoding the APLP variant is capable of hybridizing to a DNA sequence encoding the mouse APLP under stringent conditions.

Substantially pure. The term "substantially pure" means that the protein/molecule is essentially free from any other detectable biological constituents.

Animal The term "animal" is meant to refer to any living creature including mammals, for example, humans.

Having established the amino acid sequence of both APLP1 and APLP2, a nucleotide probe can be constructed which is complementary to the DNA, or mRNA coding for APLP1 or APLP2 or a fragment thereof. This probe can be used as a diagnostic test to determine the presence of APLPs including APLP1 and APLP2.

II. Genetic Engineering of an APLP

This invention comprises the amino acid sequence of APLP1 and APLP2, genetic sequences coding for APLP1 and for APLP2 mRNA, expression vehicles containing the genetic sequences, hosts transformed therewith, recombinant APLP1 and recombinant APLP2,and antisense RNA produced by such transformed host expression. The invention further comprises antibodies directed against APLP1 and APLP2.

The process for genetically engineering APLP1 and APLP2 sequences, according to the invention, is facilitated through the cloning of genetic sequences which are capable of encoding the peptide and through the expression of such genetic sequences. As used herein, the term "genetic sequences" is intended to refer to a nucleic acid molecule (preferably DNA). Genetic sequences which are capable of encoding the present APLP proteins are derived from a variety of sources. These sources include genomic DNA, cDNA, synthetic DNA, and combinations thereof. The preferred source of the genomic DNA or mRNA is brain or neuroblastoma cells. Post mortem RNA procedures can be followed to isolate the RNA. See Sajdel-Sulkowska et al., J. Neurochem. 40:670-680 (1983). The mRNA may then be used to obtain cDNA by techniques known to those skilled in the art. Probes may be synthesized based on the known amino acid sequence of the present APLP proteins (APLP1 and APLP2) by methods known in the art.

The APLP genomic DNA of the invention may or may not include naturally occurring introns. Moreover, such genomic DNA may be obtained in association with the 5' promoter region of the APLP gene sequences and/or with the 3' transcriptional termination region. Further, such genomic DNA may be obtained in association with the genetic sequences which encode the 5' non-translated region of the APLP mRNA and/or with the genetic sequences which encode the 3' non-translated region. To the extent that a host cell can recognize the transcriptional and/or translational regulatory signals associated with the expression of the mRNA and protein, then the 5' and/or 3' non-transcribed regions of the native gene, and/or, the 5' and/or 3' non-translated regions of the mRNA, may be retained and employed for transcriptional and translational regulation. APLP genomic DNA can be extracted and purified from any brain cell of an animal, including, for example, mice and humans, or from any cell containing human chromosome 19, or from any cell that expresses APLP by means well known in the art (for example, see Guide to Molecular Cloning Techniques, S. L. Berger et al., eds., Academic Press (1987)).

Alternatively, APLP mRNA can be isolated from any cell which produces or expresses APLP, and used to produce cDNA by means well known in the art (for example, see Guide to Molecular Cloning Techniques, S. L. Berger et al., eds., Academic Press (1987)). Preferably, the mRNA preparation used will be enriched in mRNA coding for APLP, either naturally, by isolation from cells which are producing large amounts of the protein, or in vitro, by techniques commonly used to enrich mRNA preparations for specific sequences, such as sucrose gradient centrifugation, or both.

For cloning into a vector, such suitable DNA preparations (either human genomic DNA or cDNA) are randomly sheared or enzymatically cleaved, respectively, and ligated into appropriate vectors to form a recombinant gene (either genomic or cDNA) library. A DNA sequence encoding APLP or its functional derivatives may be inserted into a DNA vector in accordance with conventional techniques, including blunt-ending or staggered-ending termini for ligation, restriction enzyme digestion to provide appropriate termini, filling in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and ligation with appropriate ligases. Techniques for such manipulations are disclosed by Maniatis et al., (In: Molecular cloning, A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, N.Y., (1982)), and are well known in the art.

Libraries containing APLP clones may be screened and an APLP clone identified by any means which specifically selects for APLP DNA such as, for example, a) by hybridization with an appropriate nucleic acid probe(s) containing a sequence specific for the DNA of this protein, or b) by hybridization-selected translational analysis in which native mRNA which hybridizes to the clone in question is translated in vitro and the translation products are further characterized, or, c) if the cloned genetic sequences are themselves capable of expressing mRNA, by immunoprecipitation of a translated APLP product produced by the host containing the clone.

Oligonucleotide probes specific for APLP which can be used to identify clones to this protein can be designed from knowledge of the amino acid sequence of APLP1 or APLP2. The sequence of amino acid residues in a peptide is designated herein either through the use of their commonly employed three-letter designations or by their single-letter designations. A listing of these three-letter and one-letter designations may be found in textbooks such as Biochemistry, Lehninger, A., Worth Publishers, New York, N.Y. (1970). When the amino acid sequence is listed horizontally, the amino terminus is intended to be on the left end whereas the carboxy terminus is intended to be at the right end. The residues of amino acids in a peptide may be separated by hyphens. Such hyphens are intended solely to facilitate the presentation of a sequence.

Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid (Watson, J. D., In: Molecular Biology of the Gene, 3rd Ed., W. A. Benjamin, Inc., Menlo Park, Calif. (1977), pp. 356-357). The peptide fragments are analyzed to identify sequences of amino acids which may be encoded by oligonucleotides having the lowest degree of degeneracy. This is preferably accomplished by identifying sequences that contain amino acids which are encoded by only a single codon.

Although occasionally an amino acid sequence may be encoded by only a single oligonucleotide sequence, frequently the amino acid sequence may be encoded by any of a set of similar oligonucleotides. Importantly, whereas all of the members of this set contain oligonucleotide sequences which are capable of encoding the same peptide fragment and, thus, potentially contain the same oligonucleotide sequence as the gene which encodes the peptide fragment, only one member of the set contains the nucleotide sequence that is identical to the exon coding sequence of the gene. Because this member is present within the set, and is capable of hybridizing to DNA even in the presence of the other members of the set, it is possible to employ the unfractionated set of oligonucleotides in the same manner in which one would employ a single oligonucleotide to clone the gene that encodes the peptide.

Using the genetic code (Watson, J. D., In: Molecular Biology of the Gene, 3rd Ed., W. A. Benjamin, Inc., Menlo Park, Calif. (1977)), one or more different oligonucleotides can be identified from the amino acid sequence, each of which would be capable of encoding the APLP. The probability that a particular oligonucleotide will, in fact, constitute the actual APLP coding sequence can be estimated by considering abnormal base pairing relationships and the frequency with which a particular codon is actually used (to encode a particular amino acid) in eukaryotic cells. Such "codon usage rules" are disclosed by Lathe et al., J. Molec. Biol. 183:1-12 (1985). Using the "codon usage rules" of Lathe, a single oligonucleotide sequence, or a set of oligonucleotide sequences, that contain a theoretical "most probable" nucleotide sequence capable of encoding the APLP sequences is identified.

The suitable oligonucleotide, or set of oligonucleotides, which is capable of encoding a fragment of an APLP gene (or which is complementary to such an oligonucleotide, or set of oligonucleotides) may be synthesized by means well known in the art (see, for example, Synthesis and Application of DNA and RNA, S. A. Narang, ed., 1987, Academic Press, San Diego, Calif.) and employed as a probe to identify and isolate the cloned APLP gene by techniques known in the art. Techniques of nucleic acid hybridization and clone identification are disclosed by Maniatis et al. (In: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, N.Y. (1982)); Berger et al., (In: Guide to Molecular Cloning Techniques, Academic Press (1988)); Sambrook et al., (In: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y., 2d ed. (1989); and by Hames et al. (In: Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, D.C. (1985)), which references are herein incorporated by reference. Those members of the above-described gene library which are found to be capable of such hybridization are then analyzed to determine the extent and nature of the APLP encoding sequences which they contain.

To facilitate the detection of the desired APLP DNA encoding sequence, the above-described DNA probe is labeled with a detectable group. Such detectable group can be any material having a detectable physical or chemical property. Such materials have been well-developed in the field of nucleic acid hybridization and in general most any label useful in such methods can be applied to the present invention. Particularly useful are radio- active labels, such as ³² P, ³ H, ¹⁴ C, ³⁵ S, ¹²⁵ I, or the like. Any radioactive label may be employed which provides for an adequate signal and has a sufficient half-life. The oligonucleotide may be radioactively labeled, for example, by "nick-translation" by well-known means, as described in, for example, Rigby et al., J. Mol. Biol. 113:237 (1977) and by T4 DNA polymerase replacement synthesis as described in, for example, Deen et al., Anal. Biochem. 135:456 (1983).

Alternatively, polynucleotides are also useful as nucleic acid hybridization probes when labeled with a non-radioactive marker such as biotin, an enzyme or a fluorescent group. See, for example, Leary et al., Proc. Natl. Acad. Sci. USA 80:4045 (1983); Renz et al., Nucl. Acids Res. 12:3435 (1984); and Renz, M., EMBO J. 6:817 (1983).

Thus, in summary, the actual identification of APLP sequences permits the identification of a theoretical "most probable" DNA sequence, or a set of such sequences, capable of encoding such a peptide. By constructing an oligonucleotide complementary to this theoretical sequence (or by constructing a set of oligonucleotides complementary to the set of "most probable" oligo-nucleotides), one obtains a DNA molecule (or set of DNA molecules), capable of functioning as a probe(s) for the identification and isolation of clones containing an APLP (i.e. APLP1 or APLP2) gene.

In an alternative way of cloning an APLP gene, a library is prepared using an expression vector, by cloning DNA or, more preferably cDNA prepared from a cell capable of expressing an APLP, into an expression vector. The library is then screened for members which express the APLP, for example, by screening the library with antibodies to the protein.

The above discussed methods are, therefore, capable of identifying genetic sequences which are capable of encoding an APLP or fragments of this protein. In order to further characterize such genetic sequences, and, in order to produce the recombinant protein, it is desirable to express the proteins which these sequences encode. Such expression identifies those clones which express proteins possessing characteristics of an APLP. Such characteristics may include the ability to specifically bind an APLP antibody and the ability to elicit the production of antibody which are capable of binding to an APLP.

III. Expression of an APLP and Its Functional Derivatives

To express an APLP, transcriptional and translational signals recognizable by an appropriate host are necessary. The cloned APLP encoding sequences, obtained through the methods described above, and preferably in a double-stranded form, may be operably linked to sequences controlling transcriptional expression in an expression vector, and introduced into a host cell, either prokaryote or eukaryote, to produce recombinant APLP or a functional derivative thereof. Depending upon which strand of the APLP encoding sequence is operably linked to the sequences controlling transcriptional expression, it is also possible to express APLP antisense RNA or a functional derivative thereof.

Expression of the APLP in different hosts may result in different post-translational modifications which may alter the properties of the APLP. The present invention encompasses the expression of the APLP, or a functional derivative thereof, in eukaryotic cells, and especially mammalian, insect and yeast cells. Especially preferred eukaryotic hosts are mammalian cells either in vivo, in animals or in tissue culture. Mammalian cells provide post-translational modifications to recombinant APLP which include folding and/or glycosylation at sites similar or identical to that found for a native APLP. Most preferably, mammalian host cells include brain and neuroblastoma cells.

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a polypeptide if it contains expression control sequences which contain transcriptional regulatory information and such sequences are "operably linked" to the nucleotide sequence which encodes the polypeptide.

An operable linkage is a linkage in which a sequence is connected to a regulatory sequence (or sequences) in such a way as to place expression of the sequence under the influence or control of the regulatory sequence. Two DNA sequences (such as an APLP encoding sequence and a promoter region sequence linked to the 5' end of the encoding sequence) are said to be operably linked if induction of promoter function results in the transcription of the APLP encoding sequence mRNA and if the nature of the linkage between the two DNA sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the ability of the expression regulatory sequences to direct the expression of the APLP mRNA, antisense RNA, or protein, or (3) interfere with the ability of the APLP template to be transcribed by the promoter region sequence. Thus, a promoter region would be operably linked to a DNA sequence if the promoter were capable of effecting transcription of that DNA sequence.

The precise nature of the regulatory regions needed for gene expression may vary between species or cell types, but shall in general include, as necessary, 5' non-transcribing and 5' non-translating (non-coding) sequences involved with initiation of transcription and translation respectively, such as the TATA box, capping sequence, CAAT sequence, and the like. Especially, such 5' non-transcribing control sequences will include a region which contains a promoter for transcriptional control of the operably linked gene.

Expression of the APLP in eukaryotic hosts requires the use of regulatory regions functional in such hosts, and preferably eukaryotic regulatory systems. A wide variety of transcriptional and translational regulatory sequences can be employed, depending upon the nature of the eukaryotic host. The transcriptional and translational regulatory signals can also be derived from the genomic sequences of viruses which infect eukaryotic cells, such as adenovirus, bovine papilloma virus, Simian virus, herpes virus, or the like. Preferably, these regulatory signals are associated with a particular gene which is capable of a high level of expression in the host cell.

In eukaryotes, where transcription is not linked to translation, such control regions may or may not provide an initiator methionine (AUG) codon, depending on whether the cloned sequence contains such a methionine. Such regions will, in general, include a promoter region sufficient to direct the initiation of RNA synthesis in the host cell. Promoters from heterologous mammalian genes which encode mRNA product capable of translation are preferred, and especially, strong promoters such as the promoter for actin, collagen, myosin, etc., can be employed provided they also function as promoters in the host cell. Preferred eukaryotic promoters include the promoter of the mouse metallothionein I gene (Hamer et al., J. Mol. Appl. Gen. 1:273-288 (1982)); the TK promoter of Herpes virus (McKnight, S., Cell 31:355-365 (1982)); the SV40 early promoter (Benoist et al., Nature (London) 290:304-310 (1981)); in yeast, the yeast gal4 gene promoter (Johnston et al., Proc. Natl. Acad. Sci. USA 79:6971-6975 (1982); Silver et al., Proc. Natl. Acad. Sci. USA 81:5951-5955 (1984)) or a glycolytic gene promoter may be used.

As is widely known, translation of eukaryotic mRNA is initiated at the codon which encodes the first methionine. For this reason, it is preferable to ensure that the linkage between a eukaryotic promoter and a DNA sequence which encodes an APLP, or a functional derivative thereof, does not contain any intervening codons which are capable of encoding a methionine. The presence of such codons results either in a formation of a fusion protein (if the AUG codon is in the same reading frame as the APLP encoding DNA sequence) or a frame-shift mutation (if the AUG codon is not in the same reading frame as the APLP encoding sequence).

If desired, a fusion product of an APLP may be constructed. For example, the sequence coding for APLP may be linked to a signal sequence which will allow secretion of the protein from, or the compartmentalization of the protein in, a particular host. Such signal sequences may be designed with or without specific protease sites such that the signal peptide sequence is amenable to subsequent removal. Alternatively, the native signal sequence for this protein may be used.

Transcriptional initiation regulatory signals can be selected which allow for repression or activation, so that expression of the operably linked genes can be modulated. Of interest are regulatory signals which are temperature-sensitive so that by varying the temperature, expression can be repressed or initiated, or are subject to chemical regulation, e.g., metabolite. Also of interest are constructs wherein APLP mRNA and antisense RNA are provided in a transcribable form, but with different promoters or other transcriptional regulatory elements such that induction of APLP mRNA expression is accompanied by repression of antisense RNA expression, and/or repression of APLP mRNA expression is accompanied by induction of antisense RNA expression.

Translational signals are not necessary when it is desired to express APLP antisense RNA sequences.

If desired, the non-transcribed and/or non-translated regions 3' to the sequence coding for APLP can be obtained by the above-described cloning methods. The 3'-non-transcribed region may be retained for its transcriptional termination regulatory sequence elements; the 3-non-translated region may be retained for its translational termination regulatory sequence elements, or for those elements which direct polyadenylation in eukaryotic cells. Where the native expression control sequence signals do not function satisfactorily host cell, then sequences functional in the host cell may be substituted.

The vectors of the invention may further comprise other operably linked regulatory elements such as enhancer sequences, or DNA elements which confer tissue or cell-type specific expression on an operably linked gene.

To transform a mammalian cell with the DNA constructs of the invention many vector systems are available, depending upon whether it is desired to insert an APLP DNA construct into the host cell chromosomal DNA, or to allow it to exist in an extrachromosomal form.

If the APLP DNA encoding sequence and an operably linked promoter is introduced into a recipient eukaryotic cell as a non-replicating DNA (or RNA) molecule, which may either be a linear molecule or a closed covalent circular molecule which is incapable of autonomous replication, the expression of the APLP may occur through the transient expression of the introduced sequence.

Genetically stable transformants may be constructed with vector systems, or transformation systems, whereby APLP DNA is integrated into the host chromosome. Such integration may occur de novo within the cell or, in a most preferred embodiment, be assisted by transformation with a vector which functionally inserts itself into the host chromosome, for example, with retroviral vectors, transposons or other DNA elements which promote integration of DNA sequences in chromosomes. A vector is employed which is capable of integrating the desired gene sequences into a mammalian host cell chromosome.

Cells which have stably integrated the introduced DNA into their chromosomes are selected by also introducing one or more markers which allow for selection of host cells which contain the expression vector in the chromosome, for example, the marker may provide biocide resistance, e.g., resistance to antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene can either be directly linked to the DNA gene sequences to be expressed, or introduced into the same cell by co-transfection.

In another embodiment, the introduced sequence is incorporated into a plasmid or viral vector capable of autonomous replication in the recipient host. Any of a wide variety of vectors may be employed for this purpose, as outlined below.

Factors of importance in selecting a particular plasmid or viral vector include: the ease with which recipient cells that contain the vector may be recognized and selected from those recipient cells which do not contain the vector; the number of copies of the vector which are desired in a particular host; and whether it is desirable to be able to "shuttle" the vector between host cells of different species.

Preferred eukaryotic plasmids include those derived from the bovine papilloma virus, vaccinia virus, SV40, and, in yeast, plasmids containing the 2-micron circle, etc., or their derivatives. Such plasmids are well known in the art (Botstein et al., Miami Wntr. Symp. 19:265-274 (1982); Broach, J. R., In: The Molecular Biology of the Yeast Saccharomyces: Life Cycle and Inheritance, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., pp. 445-470 (1981); Broach, J. R., Cell 28:203-204 (1982); Bollon et al., J. Clin. Hematol. Oncol. 10:39-48 (1980); Maniatis, T., In: Cell Biology: A Comprehensive Treatise, Vol. 3, "Gene Expression," Academic Press, N.Y., pp. 563-608 (1980)), and are commercially available. For example, mammalian expression vector systems which utilize the MSV-LTR promoter to drive expression of the cloned gene, and in which it is possible to cotransfect with a helper virus to amplify plasmid copy number, and integrate the plasmid into the chromosomes of host cells have been described (Perkins et al., Mol. Cell Biol. 3:1123 (1983); Clontech, Palo Alto, Calif.).

Once the vector or DNA sequence containing the construct(s) is prepared for expression, the DNA construct(s) is introduced into an appropriate host cell by any of a variety of suitable means, including transfection. After the introduction of the vector, recipient cells are grown in a selective medium, which selects for the growth of vector-containing cells. Expression of the cloned gene sequence(s) results in the production of an APLP, or in the production of a fragment of this protein. This expression can take place in a continuous manner in the transformed cells, or in a controlled manner, for example, expression which follows induction of differentiation of the transformed cells (for example, by administration of bromodeoxyuracil to neuroblastoma cells or the like).

The expressed protein is isolated and purified in accordance with conventional conditions, such as extraction, precipitation, chromatography, affinity chromatography, electrophoresis, or the like.

The APLP DNA encoding sequences, obtained through the methods above, will provide sequences which, by definition, encode an APLP and which may then be used to obtain APLP antisense RNA genetic sequences as the antisense RNA sequence will be that sequence found on the opposite strand of the strand transcribing the peptide core's mRNA. The antisense DNA strand may also be operably linked to a promoter in an expression vector such that transformation with this vector results in a host capable of expression of an APLP antisense RNA in the transformed cell. Antisense RNA and its expression may be used to interact with an endogenous APLP DNA or RNA in a manner which inhibits or represses transcription or translation of an APLP gene in a highly specific manner. Use of antisense RNA probes to block gene expression is discussed in Lichtenstein, C., Nature 333:801-802 (1988). For example, such probes can be used to block the expression of an APLP when the expression is aberrant.

IV. Construction and Identification of Antibodies to an APLP

In the following description, reference will be made to various methodologies well-known to those skilled in the art of immunology. Standard reference works setting forth the general principles of immunology include the work of Catty, D., (Antibodies, A Practical Approach, Vol. 1,IRL Press, Washington, D.C. (1988)); Klein, J., (Immunology: The Science of Cell-Noncell Discrimination, John Wiley & Sons, New York (1982)); Kennett et al., (Monoclonal Antibodies, Hybridoma: A New Dimension in Biological Analyses, Plenum Press, New York (1980)); Campbell, A. ("Monoclonal Antibody Technology," In: Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 13 (Burdon, R., et al., eds.), Elsevier, Amsterdam (1984)); and Eisen, H. N. (In: Microbiology, 3rd ed. (Davis, B. D., et al., Harper & Row, Philadelphia (1980)).

An antibody is said to be "capable of binding" a molecule if it is capable of specifically reacting with the molecule to thereby bind the molecule to the antibody. The term "epitope" is meant to refer to that portion of a hapten which can be recognized and bound by an antibody. An antigen may have one, or more than one epitope. An "antigen" is capable of inducing an animal to produce antibody capable of binding to an epitope of that antigen. The specific reaction referred to above is meant to indicate that the antigen will react, in a highly selective manner, with its corresponding antibody and not with the multitude of other antibodies which may be evoked by other antigens.

The term "antibody" (Ab) or "monoclonal antibody" (Mab) as used herein is meant to include intact molecules as well as fragments thereof (such as, for example, Fab and F(ab')₂ fragments) which are capable of binding an antigen. Fab and F(ab')₂ fragments lack the Fc fragment of intact antibody, clear more rapidly from the circulation, and may have less non-specific tissue binding of an intact antibody (Wahl et al., J. Nucl. Med. 24:316-325 (1983)).

The antibodies of the present invention have specificity to one or more epitopes present on an APLP (i.e. APLP1 or APLP2) peptide. The antibodies of the invention can be polyclonal or monoclonal, provided that they are made with the present polypeptide or fragment thereof as an immunogen. Both of these types of antibodies can be utilized in the multiple applications described herein below.

The present antibodies can be used to detect the presence of an APLP in a biological specimen such as body fluid, serum, or a tissue sample. An APLP can be detected by contacting the sample with an imaging-effective amount of the present detectably labeled antibody and detecting the label, thereby establishing the presence of the APLP in the sample. The sample can be a tissue section, serum, or other biological specimen, or detection can be carried out by imaging in vivo. An APLP can also be detected by known immunoassay techniques, including, for example, RIA, ELISA, etc., using the present antibodies.

The antibodies of the present invention are prepared by any of a variety of methods. For example, cells expressing an APLP, or a fragment thereof, can be administered to an animal in order to induce the production of sera containing polyclonal antibodies that are capable of binding the APLP. For example, an APLP fragment is chemically synthesized and purified by HPLC to render it substantially free of contaminants. Such a preparation is then introduced into an animal in order to produce polyclonal antisera of high specific activity.

Polyclonal antibodies can be generated n any suitable animal including, for example, mice, rabbits or goats. The present peptide can be injected by itself or linked to appropriate immunoactivating carriers, such as Keyhole's limpet hemocyanin (KLH). See Antibodies, A Practical Handbook, Vols. I and II, D. Catty, ed., IRL Press, Washington, D.C. (1988).

Monoclonal antibodies can be prepared in various ways using techniques well understood by those having ordinary skill in the art. For example, monoclonal antibodies can be prepared using hybridoma technology (Kohler et al., Nature 256:495 (1975); Kohler et al., Eur. J. Immunol. 6:511 (1976); Kohler et al., Eur. J. Immunol. 6:292 (1976); Hammerling et al., In: Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N.Y., pp. 563-681 (1981)); Monoclonal Antibodies--Hybridomas: A NewDimension in Biological Analysis, edited by Roger H. Kennett et al., published by Plenum Press (1980). In general, such procedures involve immunizing an animal with an APLP antigen. The splenocytes of such animals are extracted and fused with a suitable myeloma cell line. Any suitable myeloma cell line may be employed in accordance with the present invention. After fusion, the resulting hybridoma cells are selectively maintained in HAT medium, and then cloned by limiting dilution as described by Wands et al., Gastroenterology 80:225-232 (1981), which reference is herein incorporated by reference. The hybridoma cells obtained through such a selection are then assayed to identify clones which secrete antibodies capable of binding the APLP antigen.

Through application of the above-described methods, additional cell lines capable of producing antibodies which recognize epitopes of an APLP can be obtained.

For example, additional hybridomas which produce monoclonal antibodies which enable the detection of an APLP can be easily produced and isolated with minimal screening. Hybridomas producing monoclonal antibodies specific for epitopes which are found on an APLP are most effectively produced by first immunizing an animal from which hybridomas can be produced such as, for example, a Balb/c mouse, with initial subcutaneous injections of Freund's adjuvant, followed by booster injections within a few days. The fusion can be carried out using any of the techniques commonly known to those of ordinary skill in the art. The screening of the hybridomas to determine which ones are producing monoclonal antibodies specific for the present peptide is straightforward and can be done either in a standard ELISA or RIA format. For example, in an RIA screening format the culture supernatant, or ascites fluid from a hybridoma producing monoclonal antibody is reacted with ¹²⁵ I-peptide.

Of special interest to the present invention are antibodies which are produced in humans, or are "humanized" (i.e. non-immunogenic in a human) by recombinant or other technology such that they will not be antigenic in humans, or will be maintained in the circulating serum of a recipient for a longer period of time.

Humanized antibodies may be produced, for example by replacing an immunogenic portion of an antibody with a corresponding, but non-immunogenic portion (i.e. chimeric antibodies) (Robinson et al., International Patent Publication PCT/US86/02269; Akira et al., European Patent Application 184,187; Taniguchi, M., European Patent Application 171,496; Morrison et al., European Patent Application 173,494; Neuberger et al., PCT Application WO 86/01533; Cabilly et al., European Patent Application 125,023; Better et al., Science 240:1041-1043 (1988); Liu et al., Proc. Natl. Acad. Sci. USA 84:3439-3443 (1987); Liu et al., J. Immunol. 139:3521-3526 (1987); Sun et al., Proc. Natl. Acad. Sci. USA 84:214-218 (1987); Nishimura et al., Canc. Res. 47:999-1005 (1987); Wood et al., Nature 314:446-449 (1985)); Shaw et al., J. Natl. Cancer Inst. 80:1553-1559 (1988). General reviews of "humanized" chimeric antibodies are provided by Morrison, S. L. (Science, 229:1202-1207 (1985)) and by Oi et al., BioTechniques 4:214 (1986)).

Suitable "humanized" antibodies can be alternatively produced as described by Jones et al., Nature 321:552-525 (1986); Verhoeyan et al., Science 234:1534 (1988), and Beidler et al., J. Immunol. 141:4053-4060 (1988).

Antibodies against both highly conserved and poorly conserved regions of an APLP are useful for studies on the control of biosynthesis and catabolism of an APLP in normal and pathologic conditions. Further, these antibodies can be used clinically to monitor the progress of disease states wherein the expression of an APLP is aberrant.

The antibodies of the present invention can be utilized in immunoassays for the detection of an APLP wherever it may occur, including fluid, semi-fluid, or tissue samples, serum, or other biological specimen. The immunoassays can be competitive or sandwich, or as is otherwise well known and they all depend on the formation of antibody-antigen immune complex. These assays are well known to those of skill in the art.

For purposes of the assays, the antibodies can be immobilized or labeled. There are many carriers to which the antibodies can be bound for immobilization and which can be used in the present invention. Well-known carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, agaroses, and magnetite. The nature of the carrier can be either soluble to some extent or insoluble for purposes of the invention. Those skilled in the art will know many other suitable carriers for binding the antibodies, or will be able to ascertain such, using routine experimentation.

Depending on the particular embodiment of the invention, one or more of the antibodies will be coupled with a detectable label such as an enzyme, radioactive isotope, fluorescent compound, chemiluminescent compound, or bioluminescent compound.

Those of ordinary skill in the art will know of other suitable labels for binding to the antibodies or will be able to ascertain such using routine experimentation. Furthermore, the binding of these labels to the antibodies can be done using standard techniques commonly known to those of ordinary skill in the art.

The antibodies can be bound to an enzyme. This enzyme, in turn, when later exposed to its substrate will react to the substrate in such a manner as to produce a chemical moiety which can be detected, as, for example, spectrophotometric or fluorometric means. Examples of enzymes that can be used to detectably label are malate dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase, triose phosphate isomerase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholinesterase.

The presence of an antibody can also be detected by labeling it with a radioactive isotope. The presence of the radioactive isotope could then be determined by such means as the use of a gamma counter or a scintillation counter. Isotopes which are particularly useful are ³ H, ¹²⁵ I, ³² P, ³⁵ S, 14C, ⁵¹ Cr, ³⁶ Cl, ⁵⁷ Co, ⁵⁸ Co, ⁵⁹ Fe, ⁷⁵ Se, and ¹⁵² Eu.

It is also possible to detect the presence of the antibody by labeling it with a fluorescent compound. When the fluorescently labeled antibody is exposed to light of the proper wavelength, its presence can then be detected due to fluorescence of the dye. Among the most important fluorescent labeling compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde, and fluorescamine.

Another way in which the antibody can be detectably labeled is by coupling it to a chemiluminescent compound. The presence of the chemiluminescent-tagged antibody is then determined by detecting the presence of luminescence that arises during the course of a chemical reaction. Examples of particularly useful chemiluminescent labeling compounds are luminol, isoluminol, aromatic-acridinium ester, imidazole, acridinium salt, and oxalate ester.

Likewise, a bioluminescent compound may also be used to label the antibody. Bioluminescence is a special type of chemiluminescence which is found in biological systems and in which a catalytic protein increases the efficiency of the chemiluminescent reaction. The presence of a bioluminescent binding partner would be determined by detecting the presence of luminescence. Important bioluminescent compounds for purposes of labeling are luciferin, luciferase, and aequorin.

The antibodies for use in the assay of the invention are ideally suited for the preparation of a kit. Such a kit may comprise a carrier means being compartmentalized to receive in close confinement one or more container means such as vials, tubes, and the like, each of said container means comprising one of the separate elements to be used in the method.

For example, one of the container means may comprise a first antibody bound to an insoluble or partly soluble carrier. A second container may comprise soluble, detectably-labeled second antibody, in lyophilized form or in solution. The carrier means may also contain a third container means comprising a detectably-labeled third antibody in lyophilized form or in solution. Such a kit can be used for sandwich assays. See, e.g., David et al., U.S. Pat. No. 4,376,110 herein incorporated by reference.

In addition, the carrier means may also contain a plurality of containers each of which comprises different, predetermined amounts of known APLP. These latter containers can then be used to prepare a standard curve into which can be interpolated the results obtained from the sample containing the unknown amount of an APLP antigen.

Imaging can be carried out in vitro or in vivo. In vitro imaging can be done with the labels mentioned previously. In vivo imaging is done with diagnostically effective labeled antibodies. The term "diagnostically effective" means that the amount of detectably labeled antibody administered is sufficient to enable detection of the site of APLP presence when compared to a background signal.

Generally, the dosage of detectably labeled antibody for diagnosis will vary depending on considerations such as age, condition, sex, and extent of disease in the patient, counterindications, if any, and other variables, to be adjusted by the individual physician. Dosage can very from 0.01 mg/kg to 2,000 mg/kg, preferably 0.1 mg/kg to 1,000 mg/kg.

The term "diagnostically labeled" means that the immunoglobulin has attached to it a diagnostically detectable label.

There are many different imaging labels and methods of labeling known to those of ordinary skill in the art. Examples of the types of labels which can be used in the present invention include radioactive isotopes and paramagnetic isotopes.

For diagnostic in vivo imaging, the type of detection instrument available is a major factor in selecting a given radionuclide. The radionuclide chosen must have a type of decay which is detectable for a given type of instrument. In general, any conventional method for visualizing diagnostic imaging can be utilized in accordance with this invention.

Another important factor in selecting a radionuclide for in vivo diagnosis is that the half-life of a radionuclide be long enough so that it is still detectable at the time of maximum uptake by the target, but short enough so that deleterious radiation upon the host is minimized. Ideally, a radionuclide used for in vivo imaging will lack a particulate emission, but produce a large number of photons in a 140-200 keV range, which may be readily detected by conventional gamma cameras.

For in vivo diagnosis, radionuclides may be bound to antibody either directly or indirectly by using an intermediary functional group. Intermediary functional groups which are often used to bind radioisotopes which exist as metallic ions to antibody are diethylenetriaminepentaacetic acid (DTPA) and ethylenediaminetetracetic acid (EDTA). Typical examples of metallic ions which can be bound to immunoglobulins are ^(99m) Tc, ¹²³ I, ¹¹¹ In, ¹³¹ I, ⁹⁷ Ru, ⁶⁷ Cu, ⁶⁷ Ga, ⁷² As, ⁸⁹ Zr, and ²⁰¹ T1.

The antibodies used in the method of the invention can also be labeled with paramagnetic isotopes for purposes of in vivo diagnosis. Elements which are particularly useful (as in magnetic resonance imaging (MRI) techniques) in this manner include ¹⁵⁷ Gd, ⁵⁵ Mn, ¹⁶² Dy, ⁵² Cr, and ⁵⁶ Fe.

Preparations of the imaging antibodies for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propyleneglycol, polyethyleneglycol, vegetable oil such as olive oil, and injectable organic esters such as ethyloleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media, parenteral vehicles including sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers, such as those based on Ringer's dextrose, and the like. Preservatives and other additives may also be present, such as, for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. See, generally, Remington's Pharmaceutical Science, 16th ed., Mac Eds. 1980.

EXPERIMENTAL Example 1

Materials and Methods

Neuroblastoma NB2A cells were maintained as previously described (Magendantz et al., Proc. Natl. Acad. Sci. USA 82:6581-6585 (1985)). Radionucleotides were obtained from New England Nuclear and Amersham. Restriction enzymes were obtained from New England Biolabs and PCR reagents from Perkin-Elmer.

Screening of λgt11 Libraries. General techniques for preparing and screening libraries are disclosed in Maniatis et al., Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 2d ed. (1989). Three different libraries were used to obtain mouse brain cDNA clones in λgt11. A random primed and an oligo-dT primed library were obtained from Clontech. An oligo-dT primed library was obtained from Stratagene. Libraries were screened by hybridization to nitrocellulose (BA85, Schleicher and Schuell) or Nylon (Hybond-N, Amersham) according to standard procedures using cDNA that was labeled by random priming (Feinberg et al., Anal. Biochem. 132:6-13 (1983)). Positive clones were sized by PCR amplification of the λgt11 insert using primers 1218 and 1222 from New England Biolabs.

Recombinant DNA Techniques

DNA fragments were subcloned into pBluescript (Stratagene) or M13 (New England Biolabs) vectors and both strands were sequenced with Sequenase (U.S. Biochemical) according to the manufacturer's instructions. Sequence analyses were done using the UWGCG programs at the Whitaker College Computing Facility at MIT.

RACE Procedure for Obtaining 5' cDNA Extensions

The RACE (Rapid Amplification of cDNA Ends) procedure that was used is a combination of the methods of Frohman et al. (Frohman et al., Proc. Natl. Acad. Sci. USA 85:8998-9002 (1988)) and Ohara et al. (Ohara et al., Proc. Natl. Acad. Sci. USA 86:5673-5677 (1989)). For the RACE procedure, the primers were the complements of nucleotides 699-719 and 672-692 of the sequence presented in FIGS. 2A-2E. RACE products were subcloned into pBluescript, screened by hybridization to the 5' 120 bp EcoRI-PstI fragment of 69A and positive clones were sequenced.

RNA Analysis

PolyA+ RNA was prepared as in Badley et al. (Badley et al., BioTechniques 6:114-116 (1988)), using oligo-dT beads (Collaborative Research). For Northern blot analysis the RNA was separated on an agarose gel containing formaldehyde, transferred to nylon (BioTrace, Gelman Sciences) according to standard methods (Sambrook et al., Molecular Cloning (A Laboratory Manual), Cold Spring Harbor Laboratory, (1989)), and crosslinked to the nylon using a UV Crosslinker (Stratagene). The blots were hybridized and washed according to the method of Church and Gilbert (Church et al., J. Cell Biol. 107:1765-1772 (1988)). The molecular weight of the transcripts was determined by using RNA molecular weight markers.

Production Antisera to an APLP Peptide.

A peptide with the sequence QQLRELQRH (SEQ ID NO: 1) was obtained from the Biopolymers laboratory of the Howard Hughes Medical Institute and Center for Cancer Research at MIT. 20 mg of the peptide was conjugated to KLH essentially as described in Marcantonio, C. G., and Hynes, R. O., J. Cell Biol. 107:1765-1772 (1988), and the immunization of four New Band white rabbits was carried out as described in Schatz et al. (Schatz et al., Mol. Cell Biol. 7:3799-3805 (1987)).

Protein Preparation

Protein from neuroblastoma cells was isolated by rinsing the cells with PBS followed by lysing the cells in SDS sample buffer and boiling. Protein from mouse brain was isolated by homogenizing one brain in 1 ml of RIPA buffer (50 nM Tris pH 7.4, 150 mM NaCl, 5 mM EDTA, 1% Triton X-100, 1% Na deoxycholate, 0.1% SDS and protease inhibitors). The homogenate was spun in an Eppendorf centrifuge for 30 minutes at 4° C., combined with SDS sample buffer and boiled.

β-galactosidase Fusion Protein Preparation

A β-galactosidase-APLP1 fusion protein was constructed using standard techniques (Sambrook et al., Molecular Cloning (A Laboratory Manual), Cold Spring Harbor Laboratory, (1989)). An EcoRI-EcoRI fragment from a λgt11 clone containing 666 nucleotides of the 3' coding portion (nucleotides 1380-2046 of FIGS. 2A-2E), and 258 nucfeotides of the untranslated region of the APLP (nucleotides 2047-2305 of FIGS. 2A-2E.) was ligated into the EcoRI site of a pUEX5 vector.

To prepare total protein lysates from bacteria containing these plasmids, an overnight culture, diluted 1:10 into L-broth plus ampicillin, was grown for 1.5-2 hours at 300, then induced at 42° (or left at 30° for uninduced samples) for 2.5-3 hours. The bacteria were then spun down, resuspended in 50% SDS sample buffer containing protease inhibitors and sonicated to shear chromosomal DNA.

Western Blot Analysis

Protein samples were subjected to polyacrylamide gel electrophoresis, transferred to nitrocellulose and probed with rabbit antibodies and ¹²⁵ I-labelled protein A essentially as described in Birgbauer (Birgbauer et al., J. Cell Biol. 109:1609-1620 (1989)).

Immunofluorescence

Neuroblastoma cells were plated onto glass coverslips approximately forty eight hours before fixation. Twenty four hours before fixation, the concentration of fetal calf serum in the medium of neuroblastoma cells was changed from 10% to 0.1% to induce neurite extension. Twenty minutes before fixation, concanavalin A was added to 20 mg/ml to encourage cell adhesion to the coverslips. Cells were fixed in 3.7% formaldehyde/PBS, permeabilized in acetone and blocked for 30 minutes at 37° in PBS containing 1% calf serum. Primary antibody was diluted into the blocking buffer, applied to the cells for 30 minutes and visualized with FITC conjugated goat anti-rabbit antibody. Cells were observed and photographed using a Zeiss Axioplan microscope. For the peptide competition experiment, the peptide was preincubated with the primary antibody in blocking buffer for 30 minutes before adding it to the cells (Donaldson et al., J. Cell Biol. 111:2295-2306 (1990) and Moremen et al., J. Biol. Chem. 260:6654-6662 (1985)).

Results

Identification and Cloning of APLP1

In a screen for cDNA clones encoding a microtubule-associated protein (MAP), a clone was isolated from a mouse brain cDNA library (Stratagene) which was found to have an open reading frame (ORF) homologous to that of APP. The probe that was used to screen was an antibody elicited against MAP. APLP1 is not related to any known MAP.

The cDNA clone in which the APP homology was originally identified contained a portion of the C-terminal coding sequence as well as a portion of the 3' untranslated region. To extend the APLP1 ORF in the 5' direction, probes were used from the 5'-most regions of available cDNA clones to screen two Clontech λgt11 libraries. Repetitive screens using progressively more upstream probes resulted in the isolation of a 1.8 kb cDNA clone, 69A (FIG. 1), whose 5' terminus has an EcoRI site that is present in the coding sequence of APLP1 and is the result of an EcoRI site that escaped methylation during construction of the cDNA library.

Although screening of the cDNA libraries with probes derived from the 5' end of 69A failed to identify any more APLP1 clones, the use of a variation of the RACE procedure developed by Frohman et al. (Frohman et al., Proc. Natl. Acad. Sci. USA 85:8998-9002 (1988)) and by Ohara et al. (Ohara et al., Proc. Natl. Acad. Sci. USA 86:5673-5677 (1989)) did enable the isolation of several independent, overlapping cDNA clones that extend the APP homology past the 5' EcoRI site of 69A. The longest clone, J (FIG. 1), did not contain an initiator methionine.

The sequence information obtained via the RACE procedure was used to create PCR primers and amplify the 5'-most 100 base pairs encoded by clone J. This PCR product was used as a probe in a screen of a Stratagene mouse brain cDNA library that successfully identified a number of full length APLP1 clones. Two of these were sequenced to obtain the final 313 5' nucleotides as well as the polyadenylation signal and the poly A tail of the APLP1 cDNA. The predicted initiator methionine is in agreement with the eukaryotic consensus initiation sequence (Kozak, M., Nucl. Acids Res. 12:857-872 (1984).

APLP1 is Related to APP

The 2361 nucleotides of the cDNA sequence encode an open reading frame of 653 amino acids as is shown in FIGS. 2A-2E (SEQ ID NO:3). The protein is predicted to have a short intracellular C-terminus of 46 amino acids, a membrane spanning domain of 23 amino acids, and a larger extracellular N-terminus. The predicted amino acid sequence and the overall structure of APLP1 is similar to those of APP, which resembles an integral membrane protein (Kang et al., Nature 325:733-736 (1987)). The alignment of the two amino acid sequences shown in FIGS. 3A-3B reveals that overall APLP is 42% identical and 64% similar to APP.

APLP1 is a Member of a Family of APP-like Proteins

The identities between APLP1 and APP are concentrated in three distinct regions (FIGS. 4A-4B), where the proteins are 47, 54 and 56% identical and 67, 73 and 74% similar. These same three regions have been shown previously to be shared between APP and a Drosophila APP-like protein (Drosophila APPL), and have been termed the extracellular I (EI), extracellular II (EII) and cytoplasmic (C) domains by these investigators (Rosen et al., Proc. Natl. Acad. Sci. USA 86:2478-2482 (1989)). The cytoplasmic domain homology is also present in a partial cDNA clone that has been isolated from a rat testis library (Yan et al., Proc. Natl. Acad. Sci. USA 87:2405-2408 (1990)). Only APP contains the βA4 sequence that is found in amyloid plaques.

FIGS. 4A-4B shows the domain alignment of the four proteins mentioned above. A similar alignment has been shown for the relationship between the Drosophila APPL1 and APP (Rosen et al., Proc. Natl. Acad. Sci. USA 86:2478-2482 (1989)). The testis cDNA is included only in the C domain comparison since only this portion of the predicted amino acid sequence is known (Yan et al., Proc. Natl. Acad. Sci. USA 87:2405-2408 (1990)). EI begins at amino acid 21 in the APLP1 open reading frame and spans 136 amino acids. Overall, 102 of these 136 amino acids (75%) are either identical to amino acids in the respective positions of APP or Drosophila APPL, or they are the same in all three proteins. The most striking conservation within this region is that of 12 cysteine residues in all three of the sequences. There are also two regions of amino acids that are particularly well conserved (underlined in FIGS. 4A-4B), as is an unusually acidic region composed of glutamic and/or aspartic acids that spans amino acids 237-271 in the APLP1 sequence (FIGS. 2A-2E).

EII spans 130 amino acids in the mouse APLP1 sequence. 93 of the 130 APLP1 amino acids (71%) are identical to either one or both of their counterparts in APP or the Drosophila APPL sequences. This region also contains conserved N-glycosylation site in all 3 proteins.

The third domain encompasses the C-terminal cytoplasmic region of all of the proteins, including the predicted amino acid sequence of the rat testis cDNA. The conservation of amino acids among the members of the APP-like family within this domain is particularly strong. Although the four proteins do not share homology within the predicted transmembrane domains (FIGS. 3A-3E; Rosen et al., Proc. Natl. Acad. Sci. USA 86:2478-2482 (1989); Yan et al., Proc. Natl. Acad. Sci. USA 87:2405-2408 (1990)), all of them do contain a 3-4 amino acid span of charged residues (arginine/lysine) at the cytoplasmic face of the membrane (FIGS. 4A-4B). This characteristic is often seen at the membrane-cytoplasmic junction of other proteins, and has been hypothesized to allow for an interaction with phospholipids in the membrane, or to provide a stop transfer signal for membrane bound proteins (Blobel, G., Proc. Natl. Acad. Sci. USA 77:1796-1500 (1980)).

Northern Blot Analysis

FIG. 5 shows autoradiographs of Northern blots containing polyA+ RNA from mouse brain and neuroblastoma cells that were probed with DNA corresponding to nucleotides 1791-2305 of FIGS. 2A-2E. These blots reveal that in mouse brain and neuroblastoma cells there are two messages of approximately 2.4 and 1.6 kb that hybridize to this probe. The larger message appears to be present in relatively greater abundance than the smaller message. Because of its size, it is clear that the cDNA that corresponds to the 2.4 kb message, although both messages are consistently seen in Northerns that are probed and washed under stringent conditions. The mouse APLP1 cDNA docs not hybridize to the 3.2 and 3.4 kb APP messages under the conditions used (see Materials and Methods; Kang, J. et al., Nature 325:733-736 (1987)).

Generation of Antibodies Against an APLP1 Peptide.

In order to further characterize the protein encoded by the APLP1 cDNA, antibodies were raised to a synthetic peptide which corresponds to a unique sequence of mouse APLP1. The peptide that was used as antigen corresponds to a 9 amino acid segment located near the C-terminus of the APLP1 protein (QQLRELQRH (SEQ ID NO: 1)), a region where the four proteins are not homologous.

Four rabbits were injected with the peptide as described in Materials and Methods. Two of the four rabbits (301 and 302) produced sera that strongly recognize a 65 kDa mouse brain protein that is not recognized by the appropriate preimmune sera (FIG. 6A). A smaller protein of approximately 33 kDa that is recognized by antiserum 301 may be a proteolytic degradation product of the larger protein. In FIG. 6A, the specificity of the interaction of the antibody with these proteins is demonstrated by the ability to block the binding of antibody 301 to the proteins by preabsorbing with the original peptide (lanes 2-5); an irrelevant peptide has no effect on the interaction of the antibody with either the 65 kDa or 33 kDa protein (lane 6). Antiserum 301 also recognizes a 65 kDa protein present in neuroblastoma cell extracts that is not recognized by preimmune serum (FIG. 6B).

To further confirm the specificity of the 301 antiserum, we determined whether the antiserum would recognize a β-galactosidase fusion protein containing the 222 carboxy-terminal amino acids encoded by the APLP1 cDNA. FIG. 6C shows a Western blot of bacterially produced proteins that were probed with antiserum 301. As can be seen in lane 6 of this figure, antisera 301 does specifically interact with the β-galactidase-APLP1 fusion protein.

There are a number of antibodies that have been generated against the C-terminus of APP. Because the identity between APP and the mouse APLP1 in this region is particularly strong, some of these antisera would also be likely to interact with the mouse APLP1. One of these antisera, R37 (Kang et al., Nature 325:733-736 (1987); and Ishii et al., Neuropatolo. and Appl. Neurobiol. 15:135-147 (1989)), is directed against the carboxy-terminal 15 amino acids of APP, a region where the two proteins are particularly similar (see FIGS. 4A-4B). R37 does recognize the β-galactosidase-APLP1 fusion protein and a 65 kDa mouse brain protein that comigrates with the 65 kDa protein recognized by antiserum 301 (data not shown). The 15 amino acid sequence used to raise the anti-APP antibody does not overlap the 9 amino acids used to generate antiserum 301. These data suggest that the 65 kDa protein contains two epitopes in common with the APLP1 fusion protein. Antibodies can be made which recognize only APLP1.

Anti-APLP1 Antisera Recognizes a Protein in the Golgi.

The subcellular localization of the protein recognized by antiserum 301 was assayed by immunofluorescence. When neuroblastoma cells are stained with 301, the pattern that is observed is a reticular staining near the nucleus (FIGS. 7A and 7B). Because of 3-dimensional nature of the staining, and the round shape of the cells, the image seen in any one plane of focus appears punctate rather than reticular. An identical pattern is seen with antiserum 302 (data not shown). The pattern itself is reminiscent of Golgi staining, and when these cells are stained with an antibody to a known Golgi enzyme, mannosidase II, a pattern much like that seen with antiserum 301 is observed (FIG. 7C). The inclusion of the original peptide in the antibody incubation inhibited the 301 staining (FIG. 7D). Staining was not seen when preimmune serum was used (FIG. 7E).

Discussion

The present APLP1 cDNA sequence encodes a new member of the APP-like family. The mouse homologue of the human amyloid precursor protein has been cloned previously and is 96.8% identical to the human sequence at the amino acid level (Yamada et al., Biochem. Biophys. Res. Comm. 158:906-912 (1987)). Thus, the present APLP1 cDNA, which is 42% identical to the amyloid precursor protein at the amino acid level, is not the mouse homologue of APP and is a distinct, yet related protein.

The two sequences share three domains of homology. The amino acid conservation within these domains include 12 cysteines, an unusually acidic region, a potential N-glycosylation site, a hydrophobic membrane spanning region, and several specific blocks of exact identity. It is clear that the mouse APLP, the Drosophila APPL, the rat testis protein and APP comprise a family of proteins. The extensive conservation of amino acid identity as well as both the overall and specific domain structure within this family of proteins suggests that these proteins share a common function.

There are two potentially interesting observations that can be made concerning the strict conservation of the 7-amino acid sequence located within the cytoplasmic tail of the proteins in the APP-like family (see underlined sequence in the appropriate portion of FIGS. 4A-4B). There is a potential tyrosine phosphorylation site present 8-9 amino acids from the carboxy terminus of all four sequences (Tamkun et al., Cell 46:271-282 (1986)). APP can be phosphorylated when introduced into transformed embryonic kidney cells (Oltersdorf et al., J. Biol. Chem. 265:4492-4497 (1990)) and a peptide containing a portion of the cytoplasmic domain can be phosphorylated on serine and threonine residues in vitro (Gandy et al., Proc. Natl. Acad. Sci. USA 85:6218-6221 (1988)), but tyrosine phosphorylation has not yet been demonstrated. Agents that are known to regulate protein phosphorylation appear to affect the rate of proteolytic processing of mature forms of APP (Buxbaum et al., Proc. Natl. Acad. Sci. USA 87:6003-6006 (1990)), suggesting that abnormal protein phosphorylation may be involved in the production of βA4. The sequence surrounding this tyrosine also shares homology with the only tyrosine in the α-helical domain that is conserved between several classes of intermediate filaments (Lendahl et al., Cell 60:585-595 (1990)). The conservation of this potentially phosphorylated tyrosine is intriguing in light of the role that tyrosine phosphorylation is known to play in the regulation of cell growth and differentiation.

The same tyrosine is part of the tetrameric sequence NPxY that is believed to be required for the ligand-independent, coated pit-mediated internalization of the low density lipoprotein receptor (Chen, et al., J. Biol. Chem. 265:3116-3123 (1990)). The NPxY sequence is present in the cytoplasmic tails of at least 16 other cell surface receptor molecules--including the β-integrin receptor and members of the EGF receptor family (Chen, et al., J. Biol. Chem. 265:3116-3123 (1990)).

The APLP1 cDNA that has been isolated shares at least one epitope with a 65 kDa protein that is present in mouse brain homogenates and neuroblastoma cell extracts, and it shares an epitope with a protein that localizes to the Golgi in neuroblastoma cells. An antiserum that recognizes the Drosophila APPL protein also recognizes a protein in the Golgi (Luo et al., J. Neurosci 10:3849-3861 (1990)). In addition, antibodies to the APP give a perinuclear staining pattern suggestive of either Golgi or ER localization when used for immunofluorescence on muscle fibers (Zimmermann et al., EMBO J. 7:367-372 (1988)). The N-terminal extracellular portion of both the Drosophila APPL protein and APP can be secreted via cleavage at or near the membrane (Weidman et al., Cell 57:115-126 (1989); Zimmermann et al., EMBO J. 7:367-372 (1988); and Palmert et al., Proc. Natl. Acad. Sci. USA 86:6338-6342 (1989)). Although the normal function of the proteins in the APP family remains cryptic, the present results suggest that APLP1, like APP and the Drosophila APPL may be processed in, or reside in, the Golgi.

The existence of a family of APP-like proteins implies that these proteins may share a function. The conservation of cysteines at the N-termini is indicative of conserved tertiary and/or quaternary structure, and suggests that these molecules may interact with a common extracellular molecule. Likewise the strong amino acid conservation within the intracellular C-termini suggests that the proteins in this family may interact with a common molecule inside of the cell. A distinct physiological role for APP has yet to be determined. Clues to the function of any of the members of the APP-like family of proteins should help to elucidate the normal function and processing and regulation of APP.

Example 2

Mapping the Human Chromosomal Locus Encoding APLP1

Portions of the mouse brain cDNA and a 1.8 kb partial cDNA isolated from a human brain cDNA library were used to map the human chromosomal locus encoding APLP1. To determine the best restriction digest for the selective identification of the human chromosomal APLP1 fragments, human, mouse and hamster genomic DNAs were analyzed by Southern blot hybridization using the partial human cDNA clone following digestion with EcoRI, HindIII, PstI, and TaqI. EcoRI was chosen for further analysis since it produced human DNA fragments (approximately 8 kb and 3.3 kb, data not shown) that were clearly discernable from the rodent. A panel of DNAs from 31 human-rodent somatic cell lines (Geissler et al., Somat. Cell Mol. Genet. 17:197-214 (1991)) of known karyotype was digested with EcoRI. These DNAs were then probed with the human APLP1 cDNA clone and the hybridization pattern was consistent with the assignment of the APLP1 locus to chromosome 19.

To determine the regional position of APLP1 locus on chromosome 19, the full-length mouse brain APLP1 cDNA was hybridized to EcoRI-digested genomic DNA from a number of somatic cell hybrids containing only human chromosome 19 or specific fragments of this autosome as well as other chromosomes (G35CCB, G35F3B, GM989A99c7B, G24B2AM, FON1A4, TVB1D, 1016A and 5HL94; FIG. 8). All of these hybrid lines, with the exception of GM89A99c7B, contain the two human specific APLP1 bands. GM89A99c7B contains the reciprocal part of the X: 19 translocation occurring in 908K1, G35F3 and G35FCC (FIG. 8). These results exclude the APLP1 locus from the short arm of chromosome 19 and place it between 19q13.2 and the centromere.

Discussion

While no physiological role has been determined for APLP1, its map location is interesting in view of its potential relationship to Alzheimer's disease (AD). The chief component of Alzheimer-associated amyloid is the 39-43 amino acid βA4 peptide which is derived from the larger amyloid precursor protein (APP) encoded by a gene on chromosome 21 (Kang et al., Nature 325:733-736 (1987); Robakis et al., Proc. Natl. Acad. Sci. USA 84:4190-4194 (1987); Tanzi et al., Science 235:880-884 (1987)). The gene defect for an early-onset (>65 years of age) form of familial Alzheimer's disease (FAD) has been mapped to chromosome 21 (St. George-Hyslop et al., Science 235:885-889 (1987)) and a small percentage (<3%) of FAD appears to be caused by mutations within the APP gene (Chartier-Harlin et al., Nature 353:884-846 (1991); Goate et al., Nature 349:704-706 (1991); Murrell et al., Science 254:97-99 (1991)). Genetic heterogeneity has also been reported for FAD (St. George-Hyslop et al., Nature 347: 194-197 (1990)) and a set of late-onset (>65 years of age) FAD pedigrees have recently demonstrated linkage to chromosome 19 (Pericak-Vance et al., Am. J. Hum. Genet. 48:1034-1050 (1991)). Because of the regional chromosomal localization of APLP1 to the proximal portion of 19q and the significant homology of this gene to APP, APLP1 is a candidate for the gene defect responsible for a late-onset form of FAD.

Example 3

Isolation and characterization of the Human APLP2 Gene encoding a homologue of the Alzheimer's Associated Amyloid B protein precursor

In an attempt to isolate other members of the APP protein family, the mouse APLP1 sequence was first used to scan the Genbank database for homologous sequences. In addition to obtaining matches for APP, APPL, and the partial cDNA from rate testes, a match with an anonymous 274 base pair human brain cDNA entry (Genbank accession number M78104), was noted. This match, which was significant but not identical to mouse APLP1 (63% identity), indicated that M78104 was a small piece of a cDNA encoding a second APLP. In order to characterize the APP-like gene family in more detail, full length cDNAs for this second APLP, APLP2, were isolated. The isolation and characterization of APLP2 cDNA clones from human brain, provide further support for the hypothesis that APP is a member of a highly conserved gene family.

A human brain frontal cortex Lambda Zap II cDNA library (Stratagene) was screened with a probe consisting of a PCR product generated with primers designed to amplify a portion of the 274 base pair partial cDNA sequence identified in Genbank. To prepare the probe, a primer set (5'GCAACCGAATGGACAGGGTA 3' (SEQ ID NO:16) and 5'CAAGGCAGCCA GGTAGTTCTC 3' (SEQ ID NO:17); see FIGS. 9A-9B) was used to amplify a 232 base pair product from a human occipital cortex cDNA library. The PCR product was sequenced to confirm its identity and an internal primer set (5'GTAAAGAAGGAATGGGAAGAGGC3 ' (SEQ ID NO:18), and 5'CCATCCGACGGC GGTCATTCAGC3' (SEQ ID NO:19); see FIGS. 9A-9B) was designed and used to amplify a 185 base pair PCR fragment (SG190) that was used for the human brain library screen. Screening, purification and sequencing of the SG190-positive clones, including a full length cDNA were carried out according to standard conditions (Wasco et al., Proc. Natl. Acad. Sci. USA 89:10758-10762 (1992)).

Human APLP2 is encoded by a 706 amino acid sequence (SEQ ID NO:4) that is similar to APP and APLP1 in overall structure as well as amino acid sequence. APLP2 is 52% identical, 69% similar to APP695 (FIGS. 9A-9B) and 43% identical, 63% similar to APLP1. Virtually all of the identified domains and motifs that characterize APP, APPL and APLP1 are present in APLP2. Specifically, an N-terminal cysteine-rich region (consisting of 12 cysteines), a novel zinc-binding motif (Bush et al., Neurobiol. Aging 13 (supplement 1):A.331 (1992)), an acidic-rich domain, N-glycosylation sites, a hydrophobic membrane spanning domain and a cytoplasmic domain containing a clathrin binding motif and potential serine/threonine, casine kinase I, II and tyrosine phosphorylation sites are conserved in APLP2 (FIGS. 9A-9B). FIGS. 10A-10B shows the amino acids that are identical or conservatively substituted in APLP2, APP and APLP1 demonstrating the extremely high degree of conservation among these proteins. Some of these stretches of homologous amino acids shown in FIGS. 10A-10B may contain potential consensus motifs that are germane to the function of this protein family.

Chromosomal Location of APLP2 Gene

To determine the chromosomal location of the APLP2 gene, a cDNA probe was hybridized to a filter containing Hind III-digested DNA from a panel of 43 human-rodent somatic cell hybrid lines containing either individual or specific sets of human chromosomes (Pelletier et al., Genomics 10:1079-1082 (1991); Geissler et al., Som Cell Gen 17:207-214 (1991)). The probe detected specific human APLP2 bands in somatic cell hybrid lines consistent with the assignment of the APLP2 gene locus to human chromosome 11. Specifically, a positive signal for human APLP2 was obtained in a somatic cell hybrid line containing DNA from chromosome 11 as its only human material, and in a hybrid containing chromosome 11 and three other human chromosomes.

During the sequencing of the human APLP2 cDNA clones, a single alternatively spliced form containing an exon encoding a 12 amino acid stretch was identified indicating that like APP, APLP2 is alternatively transcribed (FIGS. 9A-9B). Although a portion of the APLP2 cDNA isolated from mouse embryo contained a KPI domain similar to that in APP, a form of adult human APLP2 that contains such a domain has not yet been detected.

Northern Blot Analysis

Northern blot analysis of fetal peripheral tissue and adult brain regions demonstrated that the APLP2 message is approximately 4 kb in size (FIGS. 11A-11B). Lighter bands at approximately 3 kb and 2 kb may represent cross-hybridizing messages from other members of the APP/APLP family, or as of yet unisolated APLP2 alternative transcripts. The APLP2 transcript was detected at varying levels in all peripheral and central nervous system tissues tested and displayed a level and pattern of expression that is extremely similar to that of APP (FIG. 11A). Both transcripts are expressed in relatively abundant amounts in brain, heart, and kidney and at lower levels in liver and thymus. However, in contrast to APP, APLP2 is expressed at relatively high levels in the small intestine and lung.

To determine the distribution of APLP2 transcript in the adult human brain Northern blot analysis was carried out on mRNA from 11 different brain regions (FIG. 11B). This same blot had been previously hybridized to APP thus allowing a direct comparison of expression of the two genes (Tanzi et al., Science 235:880-884 (1987); Tanzi et al., Nature 331:528-530 (1988)). The levels of APLP2 mRNA were highest in the temporal association cortex (A20, Tanzi et al., Nature 331:528-530 (1988)), the posterior perisylvian cortex-supramarginal gyri (A40), the anterior perisylvian cortex-opercular gyri (A44) and frontal pole of the cortex (A10). These regions which are particularly affected in the brains of AD patients, normally contain a relatively large amount of APP RNA. Moderate hybridization was detected in the cerebellar cortex and the caudate-putamen. Relatively weaker hybridization was seen in the striate, extrastriate, and motor cortices (A17, A18 and A4), the hippocampus, and the thalamus. Overall, APLP2 reveals a pattern of expression that is very similar to that of APP (FIG. 11B; Tanzi et al., Science 235:880-884 (1987); Tanzi et al., Nature 331:528-530 (1988)), although, some differences were noted. For example, APLP2 is expressed at relatively higher levels than APP in thalamus, while APP expression is greater than that of APLP2 in Brodman area A40.

FIG. 12 shows the result of Northern blot hybridization of APLP2 and APP cDNA probes to RNA derived from normal and fetal brains with Down syndrome (DS), and from normal and adult brains with AD. Although APP expression is higher in the DS samples, APLP2 expression is not significantly changed. This result is not unexpected given the extra copy of chromosome 21 present in DS patients. APP expression is slightly lower in AD versus normal adult cerebellum, and is dramatically decreased in AD frontal cortex relative to normal (FIG. 12). This decrease in APP expression is probably a reflection of AD-related neuronal loss in these areas which is particularly enhanced in the frontal cortex. Surprisingly, it has presently been found that although APP expression is somewhat decreased in AD cerebellum compound to normal, APLP2 expression is clearly increased in this same AD cerebellum sample (FIG. 12). One possibility is that this may reflect a compensatory increase in APLP2 expression in response to lower levels of APP. It is equally conceivable that increased expression of APLP2 preceded the decrease in APP message.

The present inventors have discovered that APLPs may compete with APP for factors involved with maturation and processing (Wasco et al., Proc. Natl. Acad. Sci. USA 89:10758-10762 (1992)). This would require that the two proteins are produced and processed within the same cell populations. To address this issue, non isotopic in situ hybridization studies designed to localize APLP2 mRNA transcripts within the hippocampal formation, a region that is severely affected in AD, have been employed. It was found that the mRNA for APLP2 is contained in both the cell soma and, to some extent, neuronal processes of pyramidal neurons in the hippocampal formation (FIGS. 13A-13B). Much less hybridization was observed in smaller interneurons, glial cells, and endothelial cells. The subcellular localization is similar to that seen for APP and APLP1 messages using the same in situ hybridization procedure (Tanzi et al., Mol. Brain Res.:in press; Hyman et al., Mol. Brain Res.:in press; Wasco et al., Alzheimer's disease and related disorders 1992: selected communications (in press)). Moreover, the cellular specificity and regional distribution of the APLP messages are also extremely similar to those of APP indicating that the APP and APLPs are located within the same sets of neurons in the hippocampal formation.

Based on the overall conservation of amino acid sequence and domain structure within the APP gene family, these proteins may share common functions and, perhaps be processed similarly (Wasco et al., Proc. Natl. Acad. Sci. USA 89:10758-10762 (1992)). Recent data further indicate that APLP2 and APP undergo similar processing (unpublished data). Antibodies to APP, APLP1 and APPL recognize proteins in the Golgi (Wasco et al., Proc. Natl. Acad. Sci. USA 89:10758-10762 (1992); Zimmermann et al., EMBO J. 7:367-372 (1988); Palacios et al., Mol. Brain Res. 15:195-206 (1992); Luo et al., J. Neurosci. 10:3849-3861 (1990)). Likewise, APLP2 appears to be associated with the Golgi apparatus unpublished data. This suggests that maturation of these proteins in the Golgi very likely involves interaction with common factors. The apparent similarities in the processing and maturation of APP and APLP2 raises the possibility that altered expression of APLP2 or other APLPs could affect the post-translational modification and metabolism of APP in cells where these genes are co-expressed. If APLP2 or other APLPs were to interfere with the proper maturation (e.g. N- or O-glycosylation) of APP, APP could be rerouted into alternative pathways including those predisposed to amyloid formation. Along these same lines, if the metabolic machinery responsible for processing APP were overburdened with members of the APLP family, altered metabolism of APP may occur, perhaps resulting in increased production of amyloidogenic fragments. Therefore, although the APLP2 and APLP1 do not contain an Aβ domain, they may still ultimately affect the maturation and/or metabolism of APP.

All publications mentioned in this specification are indicative of the level of skill of those skilled in the art to which this invention pertains. All publications are herein incorporated by reference, to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference.

While the invention has been described in connection with the specific embodiments thereof, it will be understood that it is capable of further modifications and this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains and as may be applied to the essential features hereinbefore set forth and as follows in the scope of the appended claims.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 19                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 9 amino acids                                                      (B) TYPE: amino acid                                                           (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        GlnGlnLeuArgGluLeuGlnArgHis                                                    159                                                                            (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2358 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA                                                        (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 88..2046                                                         (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        CGGCACGAGGTGGCGCTGGGAGCTCCTGTCACCGCTGGGGCCGGGTAGGGGCGGGCGGGA60                 GCGCAGGGACGTGAGGGCCGAGCGGACATGGGGCCCACCAGCCCCGCCGCT111                         MetGlyProThrSerProAlaAla                                                       15                                                                             CGCGGTCAGGGTCGCCGCTGGCGACCGCCGCTGCCGCTGTTGCTGCCA159                            ArgGlyGlnGlyArgArgTrpArgProProLeuProLeuLeuLeuPro                               101520                                                                         CTGTCATTGCTGCTTCTGCGCGCGCAGCTCGCCGTCGGGAACCTGGCT207                            LeuSerLeuLeuLeuLeuArgAlaGlnLeuAlaValGlyAsnLeuAla                               25303540                                                                       GTTGGGAGCCCCAGCGCGGCCGAGGCTCCGGGGTCGGCTCAAGTGGCT255                            ValGlySerProSerAlaAlaGluAlaProGlySerAlaGlnValAla                               455055                                                                         GGACTATGTGGGCGTCTAACCCTTCACCGGGACTTGCGCACCGGCCGC303                            GlyLeuCysGlyArgLeuThrLeuHisArgAspLeuArgThrGlyArg                               606570                                                                         TGGGAACCAGACCCACAGCGATCACGACGCTGTCTTCTGGACCCGCAG351                            TrpGluProAspProGlnArgSerArgArgCysLeuLeuAspProGln                               758085                                                                         CGCGTGCTGGAGTACTGCAGACAGATGTACCCCGAGCTGCACATAGCA399                            ArgValLeuGluTyrCysArgGlnMetTyrProGluLeuHisIleAla                               9095100                                                                        CGCGTGGAGCAGGCTGCACAGGCCATCCCGATGGAGCGCTGGTGTGGG447                            ArgValGluGlnAlaAlaGlnAlaIleProMetGluArgTrpCysGly                               105110115120                                                                   GGTACCCGGAGTGGCAGATGCGCCCACCCCCACCATGAGGTTGTGCCC495                            GlyThrArgSerGlyArgCysAlaHisProHisHisGluValValPro                               125130135                                                                      TTCCATTGCCTGCCTGGCGAATTCGTGAGTGAAGCCCTGCTAGTGCCC543                            PheHisCysLeuProGlyGluPheValSerGluAlaLeuLeuValPro                               140145150                                                                      GAAGGCTGTCGGTTCTTGCACCAGGAGCGTATGGACCAGTGTGAGAGT591                            GluGlyCysArgPheLeuHisGlnGluArgMetAspGlnCysGluSer                               155160165                                                                      TCAACCAGGAGGCATCAGGAGGCTCAGGAGGCCTGCAGCTCTCAGGGC639                            SerThrArgArgHisGlnGluAlaGlnGluAlaCysSerSerGlnGly                               170175180                                                                      CTCATCCTGCACGGCTCTGGCATGCTTTTGCCCTGTGGCTCTGATCGG687                            LeuIleLeuHisGlySerGlyMetLeuLeuProCysGlySerAspArg                               185190195200                                                                   TTCCGAGGTGTGGAGTATGTATGCTGTCCACCTCCCGCAACTCCCAAC735                            PheArgGlyValGluTyrValCysCysProProProAlaThrProAsn                               205210215                                                                      CCATCTGGGATGGCAGCTGGTGACCCCTCTACCCGGTCCTGGCCCCTG783                            ProSerGlyMetAlaAlaGlyAspProSerThrArgSerTrpProLeu                               220225230                                                                      GGGGGCAGAGCAGAGGGAGGTGAGGATGAAGAGGAGGTGGAATCTTTC831                            GlyGlyArgAlaGluGlyGlyGluAspGluGluGluValGluSerPhe                               235240245                                                                      CCTCAGCCAGTAGACGATTACTTCGTAGAGCCCCCTCAGGCTGAAGAA879                            ProGlnProValAspAspTyrPheValGluProProGlnAlaGluGlu                               250255260                                                                      GAAGAGGAAGAGGAGGAAGAAAGGGCCCCACCTCCCAGCTCCCACACC927                            GluGluGluGluGluGluGluArgAlaProProProSerSerHisThr                               265270275280                                                                   CCTGTCATGGTTAGCAGAGTCACTCCCACCCCAAGGCCTACTGATGGT975                            ProValMetValSerArgValThrProThrProArgProThrAspGly                               285290295                                                                      GTGGATGTTTACTTTGGCATGCCTGGGGAAATCGGCGAGCATGAGGGT1023                           ValAspValTyrPheGlyMetProGlyGluIleGlyGluHisGluGly                               300305310                                                                      TTCCTGAGGGCCAAGATGGACCTGGAGGAGCGTAGGATGCGCCAGATT1071                           PheLeuArgAlaLysMetAspLeuGluGluArgArgMetArgGlnIle                               315320325                                                                      AATGAGGTGATGCGTGAATGGGCCATGGCTGACAGCCAATCTAAGAAC1119                           AsnGluValMetArgGluTrpAlaMetAlaAspSerGlnSerLysAsn                               330335340                                                                      CTGCCAAAGGCGGACAGGCAGGCCCTGAATGAGCACTTCCAGTCCATT1167                           LeuProLysAlaAspArgGlnAlaLeuAsnGluHisPheGlnSerIle                               345350355360                                                                   CTGCAGACCCTGGAAGAACAAGTGTCTGGTGAACGGCAACGCCTGGTG1215                           LeuGlnThrLeuGluGluGlnValSerGlyGluArgGlnArgLeuVal                               365370375                                                                      GAGACCCACGCCACCAGAGTCATCGCTCTGATCAACGACCAGCGCCGA1263                           GluThrHisAlaThrArgValIleAlaLeuIleAsnAspGlnArgArg                               380385390                                                                      GCAGCCCTGGAAGGTTTCCTGGCAGCCTTACAGGGCGATCCGCCTCAG1311                           AlaAlaLeuGluGlyPheLeuAlaAlaLeuGlnGlyAspProProGln                               395400405                                                                      GCTGAGCGAGTTCTGATGGCCCTGAGGCGCTACCTGCGCGCCGAGCAG1359                           AlaGluArgValLeuMetAlaLeuArgArgTyrLeuArgAlaGluGln                               410415420                                                                      AAAGAGCAGAGGCACACTCTGAGGCACTACCAGCACGTGGCCGCAGTG1407                           LysGluGlnArgHisThrLeuArgHisTyrGlnHisValAlaAlaVal                               425430435440                                                                   GATCCTGAGAAGGCCCAGCAGATGCGCTTTCAGGTCCAGACCCACCTT1455                           AspProGluLysAlaGlnGlnMetArgPheGlnValGlnThrHisLeu                               445450455                                                                      CAGGTGATCGAAGAGCGAATGAATCAGAGCCTGGGGCTGCTCGACCAG1503                           GlnValIleGluGluArgMetAsnGlnSerLeuGlyLeuLeuAspGln                               460465470                                                                      AACCCTCACCTGGCTCAGGAGCTGCGGCCACAGATCCAGGAGCTTCTC1551                           AsnProHisLeuAlaGlnGluLeuArgProGlnIleGlnGluLeuLeu                               475480485                                                                      CTTGCTGAACACTTGGGTCCCAGTGAACTGGACGCCTCTGTGCCCGGG1599                           LeuAlaGluHisLeuGlyProSerGluLeuAspAlaSerValProGly                               490495500                                                                      AGCAGCAGTGAGGACAAAGGTAGCCTCCAGCCTCCCGAATCCAAGGAC1647                           SerSerSerGluAspLysGlySerLeuGlnProProGluSerLysAsp                               505510515520                                                                   GATCCCCCAGTGACCCTTCCAAAAGGGTCCACAGATCAAGAGTCATCC1695                           AspProProValThrLeuProLysGlySerThrAspGlnGluSerSer                               525530535                                                                      TCCTCTGGGAGAGAGAAGCTAACTCCACTGGAGCAGTATGAGCAAAAG1743                           SerSerGlyArgGluLysLeuThrProLeuGluGlnTyrGluGlnLys                               540545550                                                                      GTGAATGCATCCGCCCCGAGGGGGTTTCCGTTCCACTCGTCAGATATC1791                           ValAsnAlaSerAlaProArgGlyPheProPheHisSerSerAspIle                               555560565                                                                      CAGCGGGATGAACTGGCTCCTTCCGGGACTGGAGTGTCCCGAGAGGCC1839                           GlnArgAspGluLeuAlaProSerGlyThrGlyValSerArgGluAla                               570575580                                                                      TTGTCAGGTCTGCTGATCATGGGAGCTGGAGGAGGCTCTCTCATTGTC1887                           LeuSerGlyLeuLeuIleMetGlyAlaGlyGlyGlySerLeuIleVal                               585590595600                                                                   CTATCCTTGCTGCTTCTGCGCAAGAAGAAACCCTATGGGACTATCAGC1935                           LeuSerLeuLeuLeuLeuArgLysLysLysProTyrGlyThrIleSer                               605610615                                                                      CATGGAGTGGTGGAGGTGGACCCCATGCTGACCCTGGAGGAGCAGCAG1983                           HisGlyValValGluValAspProMetLeuThrLeuGluGluGlnGln                               620625630                                                                      CTCCGGGAACTTCAGAGGCATGGCTATGAGAACCCCACCTACCGCTTC2031                           LeuArgGluLeuGlnArgHisGlyTyrGluAsnProThrTyrArgPhe                               635640645                                                                      CTGGAAGAACGACCTTGACCCCTACCCTAGCTGCCTTCAGCTGAGCCCTACTGCC2086                    LeuGluGluArgPro                                                                650                                                                            TTCTTCCGGCCCCCCAAACCCAACTCCCAGCTTCCGGTGGGGGAGGGAGATCTTGACAAA2146               TTCATTCTTGTTTCCCCTTCCTAGTTCCAAATTCCACACCCTTAGAAATCCCCAGCTCCT2206               GTCCCACAAGGGACCTCTTCACCTTAATTTATTTTACGTTAATTTATTGCTCCTTAAGGT2266               GACCTGGGTCCCAGGTATGTATGTCACTCCCTGGAATTCACCATCCCACGTTTCTTCACT2326               AACATCCCAATAAACTCCTCTTTCCCTCCGGC2358                                           (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 653 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        MetGlyProThrSerProAlaAlaArgGlyGlnGlyArgArgTrpArg                               151015                                                                         ProProLeuProLeuLeuLeuProLeuSerLeuLeuLeuLeuArgAla                               202530                                                                         GlnLeuAlaValGlyAsnLeuAlaValGlySerProSerAlaAlaGlu                               354045                                                                         AlaProGlySerAlaGlnValAlaGlyLeuCysGlyArgLeuThrLeu                               505560                                                                         HisArgAspLeuArgThrGlyArgTrpGluProAspProGlnArgSer                               65707580                                                                       ArgArgCysLeuLeuAspProGlnArgValLeuGluTyrCysArgGln                               859095                                                                         MetTyrProGluLeuHisIleAlaArgValGluGlnAlaAlaGlnAla                               100105110                                                                      IleProMetGluArgTrpCysGlyGlyThrArgSerGlyArgCysAla                               115120125                                                                      HisProHisHisGluValValProPheHisCysLeuProGlyGluPhe                               130135140                                                                      ValSerGluAlaLeuLeuValProGluGlyCysArgPheLeuHisGln                               145150155160                                                                   GluArgMetAspGlnCysGluSerSerThrArgArgHisGlnGluAla                               165170175                                                                      GlnGluAlaCysSerSerGlnGlyLeuIleLeuHisGlySerGlyMet                               180185190                                                                      LeuLeuProCysGlySerAspArgPheArgGlyValGluTyrValCys                               195200205                                                                      CysProProProAlaThrProAsnProSerGlyMetAlaAlaGlyAsp                               210215220                                                                      ProSerThrArgSerTrpProLeuGlyGlyArgAlaGluGlyGlyGlu                               225230235240                                                                   AspGluGluGluValGluSerPheProGlnProValAspAspTyrPhe                               245250255                                                                      ValGluProProGlnAlaGluGluGluGluGluGluGluGluGluArg                               260265270                                                                      AlaProProProSerSerHisThrProValMetValSerArgValThr                               275280285                                                                      ProThrProArgProThrAspGlyValAspValTyrPheGlyMetPro                               290295300                                                                      GlyGluIleGlyGluHisGluGlyPheLeuArgAlaLysMetAspLeu                               305310315320                                                                   GluGluArgArgMetArgGlnIleAsnGluValMetArgGluTrpAla                               325330335                                                                      MetAlaAspSerGlnSerLysAsnLeuProLysAlaAspArgGlnAla                               340345350                                                                      LeuAsnGluHisPheGlnSerIleLeuGlnThrLeuGluGluGlnVal                               355360365                                                                      SerGlyGluArgGlnArgLeuValGluThrHisAlaThrArgValIle                               370375380                                                                      AlaLeuIleAsnAspGlnArgArgAlaAlaLeuGluGlyPheLeuAla                               385390395400                                                                   AlaLeuGlnGlyAspProProGlnAlaGluArgValLeuMetAlaLeu                               405410415                                                                      ArgArgTyrLeuArgAlaGluGlnLysGluGlnArgHisThrLeuArg                               420425430                                                                      HisTyrGlnHisValAlaAlaValAspProGluLysAlaGlnGlnMet                               435440445                                                                      ArgPheGlnValGlnThrHisLeuGlnValIleGluGluArgMetAsn                               450455460                                                                      GlnSerLeuGlyLeuLeuAspGlnAsnProHisLeuAlaGlnGluLeu                               465470475480                                                                   ArgProGlnIleGlnGluLeuLeuLeuAlaGluHisLeuGlyProSer                               485490495                                                                      GluLeuAspAlaSerValProGlySerSerSerGluAspLysGlySer                               500505510                                                                      LeuGlnProProGluSerLysAspAspProProValThrLeuProLys                               515520525                                                                      GlySerThrAspGlnGluSerSerSerSerGlyArgGluLysLeuThr                               530535540                                                                      ProLeuGluGlnTyrGluGlnLysValAsnAlaSerAlaProArgGly                               545550555560                                                                   PheProPheHisSerSerAspIleGlnArgAspGluLeuAlaProSer                               565570575                                                                      GlyThrGlyValSerArgGluAlaLeuSerGlyLeuLeuIleMetGly                               580585590                                                                      AlaGlyGlyGlySerLeuIleValLeuSerLeuLeuLeuLeuArgLys                               595600605                                                                      LysLysProTyrGlyThrIleSerHisGlyValValGluValAspPro                               610615620                                                                      MetLeuThrLeuGluGluGlnGlnLeuArgGluLeuGlnArgHisGly                               625630635640                                                                   TyrGluAsnProThrTyrArgPheLeuGluGluArgPro                                        645650                                                                         (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 706 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        MetAlaAlaThrGlyThrAlaAlaArgAlaAlaThrGlyArgLeuLeu                               151015                                                                         LeuLeuLeuLeuValGlyLeuThrAlaProAlaAlaAlaLeuAlaGly                               202530                                                                         TyrIleGluAlaLeuAlaAlaAlaAlaGlyThrGlyPheAlaValAla                               354045                                                                         GluProGlnIleAlaMetPheCysGlyLysLeuAsnMetHisValAsn                               505560                                                                         IleGlnThrGlyLysTrpGluProAspProThrGlyThrLysSerCys                               65707580                                                                       PheArgThrLysGluGluValLeuGlnTyrCysGlnGluMetTyrPro                               859095                                                                         GluLeuGlnIleThrAsnValMetGluAlaAsnGlnArgValSerIle                               100105110                                                                      AspAsnTrpCysArgArgAspLysLysGlnCysLysSerArgPheVal                               115120125                                                                      ThrProPheLysCysLeuValGlyGluPheValSerAspValLeuLeu                               130135140                                                                      ValProGluLysCysArgPhePheHisLysGluArgMetGluValCys                               145150155160                                                                   GluAsnHisGlnHisTrpHisThrValValLysGluAlaCysLeuThr                               165170175                                                                      GlnGlyMetThrLeuTyrSerTyrGlyMetLeuLeuProCysGlyVal                               180185190                                                                      AspGlnPheHisGlyThrGluTyrValCysCysProGlnThrLysAsp                               195200205                                                                      TyrTrpSerValSerLysGluGluGluGluGluGluAspGluGluGlu                               210215220                                                                      GluGluGluGluAspGluGluGluAspTyrAspValTyrLysSerGlu                               225230235240                                                                   PheProThrGluAlaAspLeuGluAspPheThrGluAlaAlaValAsp                               245250255                                                                      GluAspAspGluAspGluGluGluGlyGluGluValValGluAspArg                               260265270                                                                      AspTyrTyrTyrAspThrPheLysGlyAspAspTyrAsnGluGluAsn                               275280285                                                                      ProThrGluProGlySerAspGlyThrMetSerAspLysGluIleThr                               290295300                                                                      HisAspValLysValProProThrProLeuProThrAsnAspValAsp                               305310315320                                                                   ValTyrPheGluThrSerAlaAspAspAsnGluHisAlaArgPheGln                               325330335                                                                      LysAlaGluLysGluGlnLeuIleGluArgHisArgAsnArgMetAsp                               340345350                                                                      ArgValLysLysGluTrpGluGluAlaGluLeuGlnAlaLysAsnLeu                               355360365                                                                      ProLysAlaGluArgGlnThrLeuIleGlnHisPheGlnAlaMetVal                               370375380                                                                      LysAlaLeuGluLysAlaGluAlaAlaSerGluLysGlnGlnLeuVal                               385390395400                                                                   GluThrHisLeuAlaArgValGluAlaMetLeuAsnAspArgArgMet                               405410415                                                                      AlaLeuGluAsnTyrLeuAlaAlaLeuGlnArgSerAspProProArg                               420425430                                                                      ProHisArgIleLeuGlnProLeuArgArgTyrValArgAlaGluAsn                               435440445                                                                      LysAspArgLeuHisThrIleArgHisTyrGlnHisValLeuAlaVal                               450455460                                                                      AspProGluLysAlaAlaGlnMetLysSerGlnValMetThrHisLeu                               465470475480                                                                   HisValIleGluGluArgArgAsnGlnSerLeuSerLeuLeuTyrLys                               485490495                                                                      AspProTyrValAlaArgIleGlnGluAsnAspGluLeuLeuGlnAla                               500505510                                                                      GluArgAlaAspMetAspGlnPheThrAlaSerIleSerGluThrPro                               515520525                                                                      ValAspValArgValSerSerGluGluSerGluGluIleProProPhe                               530535540                                                                      HisProPheHisProPheProAlaLeuProGluAsnGluAspThrGln                               545550555560                                                                   ProGluLeuTyrHisProMetLysLysGlySerGlyValGlyGluGln                               565570575                                                                      AspGlyGlyLeuIleGlyAlaGluGluLysValIleAsnSerLysAsn                               580585590                                                                      LysValAspGluAsnMetValIleAspGluThrLeuAspLysGluMet                               595600605                                                                      IlePheAsnAlaGluArgValGlyGlyLeuGluGluArgGluSerVal                               610615620                                                                      GlyProLeuArgGluAspPheSerLeuSerSerSerAlaSerIleGly                               625630635640                                                                   LeuLeuValIleAlaValAlaIleAlaThrValIleValIleSerLeu                               645650655                                                                      ValMetLeuArgLysArgGlnValCysThrIleSerHisGlyIleVal                               660665670                                                                      GluValAspProMetLeuThrProGluGluArgHisLeuAsnLysMet                               675680685                                                                      GlnAsnHisGlyTyrGluAsnProThrTyrLysThrLeuGluGlnMet                               690695700                                                                      GlnIle                                                                         705                                                                            (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 694 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        MetLeuProGlyLeuAlaLeuLeuLeuLeuAlaAlaTrpThrAlaArg                               151015                                                                         AlaLeuGluValProThrAspGlyAsnAlaGlyLeuLeuAlaGluPro                               202530                                                                         GlnIleAlaMetPheCysGlyArgLeuAsnMetHisMetAsnValGln                               354045                                                                         AsnGlyLysTrpAspSerAspProSerGlyThrLysThrCysIleAsp                               505560                                                                         ThrLysGluGlyIleLeuGlnTyrCysGlnGluValTyrProGluLeu                               65707580                                                                       GlnIleThrAsnValValGluAlaAsnGlnProValThrIleGlnAsn                               859095                                                                         TrpCysLysArgGlyArgLysGlnCysLysThrHisProHisPheVal                               100105110                                                                      IleProTyrArgCysLeuValGlyGluPheValSerAspAlaLeuLeu                               115120125                                                                      ValProAspLysCysLysPheLeuHisGlnGluArgMetAspValCys                               130135140                                                                      GluThrHisLeuHisTrpHisThrValAlaLysGluThrCysSerGlu                               145150155160                                                                   LysSerThrAsnLeuHisAspTyrGlyMetLeuLeuProCysGlyIle                               165170175                                                                      AspLysPheArgGlyValGluPheValCysCysProLeuAlaGluGlu                               180185190                                                                      SerAspAsnValAspSerAlaAspAlaGluGluAspAspSerAspVal                               195200205                                                                      TrpTrpGlyGlyAlaAspThrAspTyrAlaAspGlySerGluAspLys                               210215220                                                                      ValValGluValAlaGluGluGluGluValAlaGluValGluGluGlu                               225230235240                                                                   GluAlaAspAspAspGluAspAspGluAspGlyAspGluValGluGlu                               245250255                                                                      GluAlaGluGluProTyrGluGluAlaThrGluArgThrThrSerIle                               260265270                                                                      AlaThrThrThrThrThrThrThrGluSerValGluGluValValArg                               275280285                                                                      ValProThrThrAlaAlaSerThrProAspAlaValAspLysTyrLeu                               290295300                                                                      GluThrProGlyAspGluAsnGluHisAlaHisPheGlnLysAlaLys                               305310315320                                                                   GluArgLeuGluAlaLysHisArgGluArgMetSerGlnValMetArg                               325330335                                                                      GluTrpGluGluAlaGluArgGlnAlaLysAsnLeuProLysAlaAsp                               340345350                                                                      LysLysAlaValIleGlnHisPheGlnGluLysValGluSerLeuGlu                               355360365                                                                      GlnGluAlaAlaAsnGluArgGlnGlnLeuValGluThrHisMetAla                               370375380                                                                      ArgValGluAlaMetLeuAsnAspArgArgArgLeuAlaLeuGluAsn                               385390395400                                                                   TyrIleThrAlaLeuGlnAlaValProProArgProArgHisValPhe                               405410415                                                                      AsnMetLeuLysLysTyrValArgAlaGluGlnLysAspArgGlnHis                               420425430                                                                      ThrLeuLysHisPheGluHisValArgMetValAspProLysLysAla                               435440445                                                                      AlaGlnIleArgSerGlnValMetThrHisLeuArgValIleTyrGlu                               450455460                                                                      ArgMetAsnGlnSerLeuSerLeuLeuTyrAsnValProAlaValAla                               465470475480                                                                   GluGluIleGlnAspGluValAspGluLeuLeuGlnLysGluGlnAsn                               485490495                                                                      TyrSerAspAspValLeuAlaAsnMetIleSerGluProArgIleSer                               500505510                                                                      TyrGlyAsnAspAlaLeuMetProSerLeuThrGluThrLysThrThr                               515520525                                                                      ValGluLeuLeuProValAsnGlyGluPheSerLeuAspAspLeuGln                               530535540                                                                      ProTrpHisSerPheGlyAlaAspSerValProAlaAsnThrGluAsn                               545550555560                                                                   GluValGluProValAspAlaArgProAlaAlaAspArgGlyLeuThr                               565570575                                                                      ThrArgProGlySerGlyLeuThrAsnIleLysThrGluGluIleSer                               580585590                                                                      GluValLysMetAspAlaGluPheArgHisAspSerGlyTyrGluVal                               595600605                                                                      HisHisGlnLysLeuValPhePheAlaGluAspValGlySerAsnLys                               610615620                                                                      GlyAlaIleIleGlyLeuMetValGlyGlyValValIleAlaThrVal                               625630635640                                                                   IleValIleThrLeuValMetLeuLysLysLysGlnTyrThrSerIle                               645650655                                                                      HisHisGlyValValGluValAspAlaAlaValThrProGluGluArg                               660665670                                                                      HisLeuSerLysMetGlnGlnAsnGlyTyrGluAsnProThrTyrLys                               675680685                                                                      PhePheGluGlnMetGln                                                             690                                                                            (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 190 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        LeuLeuLeuProLeuSerLeuLeuLeuLeuArgAlaGlnLeuAlaVal                               151015                                                                         GlyAsnLeuAlaValGlySerProSerAlaAlaGluAlaProGlySer                               202530                                                                         AlaGlnValAlaGlyLeuCysGlyArgLeuThrLeuHisArgAspLeu                               354045                                                                         ArgThrGlyArgTrpGluProAspProGlnArgSerArgArgCysLeu                               505560                                                                         LeuAspProGlnArgValLeuGluTyrCysArgGlnMetTyrProGlu                               65707580                                                                       LeuHisIleAlaArgValGluGlnAlaAlaGlnAlaIleProMetGlu                               859095                                                                         ArgTrpCysGlyGlyThrArgSerGlyArgCysAlaHisProHisHis                               100105110                                                                      GluValValProPheHisCysLeuProGlyGluPheValSerGluAla                               115120125                                                                      LeuLeuValProGluGlyCysArgPheLeuHisGlnGluArgMetAsp                               130135140                                                                      GlnCysGluSerSerThrArgArgHisGlnGluAlaGlnGluAlaCys                               145150155160                                                                   SerSerGlnGlyLeuIleLeuHisGlySerGlyMetLeuLeuProCys                               165170175                                                                      GlySerAspArgPheArgGlyValGluTyrValCysCysPro                                     180185190                                                                      (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 155 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        AlaLysMetAspLeuGluGluArgArgMetArgGlnIleAsnGluVal                               151015                                                                         MetArgGluTrpAlaMetAlaAspSerGlnSerLysAsnLeuProLys                               202530                                                                         AlaAspArgGlnAlaLeuAsnGluHisPheGlnSerIleLeuGlnThr                               354045                                                                         LeuGluGluGlnValSerGlyGluArgGlnArgLeuValGluThrHis                               505560                                                                         AlaThrArgValIleAlaLeuIleAsnAspGlnArgArgAlaAlaLeu                               65707580                                                                       GluGlyPheLeuAlaAlaLeuGlnGlyAspProProGlnAlaGluArg                               859095                                                                         ValLeuMetAlaLeuArgArgTyrLeuArgAlaGluGlnLysGluGln                               100105110                                                                      ArgHisThrLeuArgHisTyrGlnHisValAlaAlaValAspProGlu                               115120125                                                                      LysAlaGlnGlnMetArgPheGlnValGlnThrHisLeuGlnValIle                               130135140                                                                      GluGluArgMetAsnGlnSerLeuGlyLeuLeu                                              145150155                                                                      (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 46 amino acids                                                     (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        LysLysLysProTyrGlyThrIleSerHisGlyValValGluValAsp                               151015                                                                         ProMetLeuThrLeuGluGluGlnGlnLeuArgGluLeuGlnArgHis                               202530                                                                         GlyTyrGluAsnProThrTyrArgPheLeuGluGluArgPro                                     354045                                                                         (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 188 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        MetLeuProGlyLeuAlaLeuLeuLeuLeuAlaAlaTrpThrAlaArg                               151015                                                                         AlaLeuGluValProThrAspGlyAsnAlaGlyLeuLeuAlaGluPro                               202530                                                                         GlnIleAlaMetPheCysGlyArgLeuAsnMetHisAsnMetValGln                               354045                                                                         AsnGlyLysTrpAspSerAspProSerGlyThrLysThrCysIleAsp                               505560                                                                         ThrLysGluGlyIleLeuGlnTyrCysGlnGluValTyrProGluLeu                               65707580                                                                       GlnIleThrAsnValValGluAlaAsnGlnProValThrIleGlnAsn                               859095                                                                         TrpCysLysArgGlyArgLysGlnCysLysThrHisProHisPheVal                               100105110                                                                      IleProTyrArgCysLeuValGlyGluPheValSerAspAlaLeuLeu                               115120125                                                                      ValProAspLysCysLysPheLeuHisGlnGluArgMetAspValCys                               130135140                                                                      GluThrHisLeuHisTrpHisThrValAlaLysGluThrCysSerGlu                               145150155160                                                                   LysSerThrAsnLeuHisAspTyrGlyMetLeuLeuProCysGlyIle                               165170175                                                                      AspLysPheArgGlyValGluPheValCysCysPro                                           180185                                                                         (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 155 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       AlaLysGluArgLeuGluAlaLysHisArgGluArgMetSerGlnVal                               151015                                                                         MetArgGluTrpGluGluAlaGluArgGlnAlaLysAsnLeuProLys                               202530                                                                         AlaAspLysLysAlaValIleGlnHisPheGlnGluLysValGluSer                               354045                                                                         LeuGluGlnGluAlaAlaAsnGluArgGlnGlnLeuValGluThrHis                               505560                                                                         MetAlaArgValGluAlaMetLeuAsnAspArgArgArgLeuAlaLeu                               65707580                                                                       GluAsnTyrIleThrAlaLeuGlnAlaValProProArgProArgHis                               859095                                                                         ValPheAsnMetLeuLysLysTyrValArgAlaGluGlnLysAspArg                               100105110                                                                      GlnHisThrLeuLysHisPheGluHisValArgMetValAspProLys                               115120125                                                                      LysAlaAlaGlnIleArgSerGlnValMetThrHisLeuArgValIle                               130135140                                                                      TyrGluArgMetAsnGlnSerLeuSerLeuLeu                                              145150155                                                                      (2) INFORMATION FOR SEQ ID NO:11:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 47 amino acids                                                     (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                       LysLysLysGlnTyrThrSerIleHisHisGlyValValGluValAsp                               151015                                                                         AlaAlaValThrProGluGluArgHisLeuSerLysMetGlnGlnAsn                               202530                                                                         GlyTyrGluAsnProThrTyrLysPhePheGluGlnMetGlnAsn                                  354045                                                                         (2) INFORMATION FOR SEQ ID NO:12:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 197 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                       MetCysAlaAlaLeuArgArgAsnLeuLeuLeuArgSerLeuTrpVal                               151015                                                                         ValLeuAlaIleGlyThrAlaGlnValGlnAlaAlaSerSerProArg                               202530                                                                         TrpProGlnIleAlaValLeuCysGluAlaGlyGlnIleTyrGlnPro                               354045                                                                         GlnTyrLeuSerGluGluGlyArgTrpValThrAspLeuSerLysLys                               505560                                                                         ThrThrGlyProThrCysLeuArgAspLysMetAspLeuAspTyrCys                               65707580                                                                       LysLysAlaTyrProAsnArgAspIleThrAsnIleValGluSerSer                               859095                                                                         HisTyrGlnLysIleGlyGlyTrpCysArgGlnGlyAlaLeuAsnAla                               100105110                                                                      AlaLysCysLysGlySerHisArgTrpIleLysProPheArgCysLeu                               115120125                                                                      GlyProPheGlnSerAspAlaLeuLeuTyrProGluGlyCysLeuPhe                               130135140                                                                      AspHisIleHisAsnAlaSerArgCysTrpProPheValArgTrpAsn                               145150155160                                                                   GlnThrGlyAlaAlaAlaCysGlnGluArgGlyMetGlyMetArgThr                               165170175                                                                      PheAlaMetLeuLeuProCysGlyIleSerValPheSerGlyValGlu                               180185190                                                                      PheValCysCysPro                                                                195                                                                            (2) INFORMATION FOR SEQ ID NO:13:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 166 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                       SerGlnLysArgLeuGluGluSerHisArgGluLysValThrArgVal                               151015                                                                         MetLysAspTrpSerAspLeuGluGluLysTyrGlnAspMetArgLeu                               202530                                                                         AlaAspProLysAlaAlaGlnSerPheLysGlnArgMetThrAlaArg                               354045                                                                         PheGlnThrSerValGlnAlaLeuGluGluGluGlyAsnAlaGluLys                               505560                                                                         HisGlnLeuAlaAlaMetHisGlnGlnArgValLeuAlaHisIleAsn                               65707580                                                                       GlnArgLysArgGluAlaMetThrCysTyrThrGlnAlaLeuThrGlu                               859095                                                                         GlnProProAsnAlaHisHisValGluLysCysLeuGlnLysIleLeu                               100105110                                                                      ArgAlaLeuHisLysAspArgAlaHisAlaLeuAlaHisTyrArgHis                               115120125                                                                      LeuLeuAsnSerGlyGlyProGlyGlyLeuGluAlaAlaAlaSerGlu                               130135140                                                                      ArgProArgThrLeuGluArgLeuIleAspIleAspArgAlaValAsn                               145150155160                                                                   GlnSerMetThrMetLeu                                                             165                                                                            (2) INFORMATION FOR SEQ ID NO:14:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 53 amino acids                                                     (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                       LysTrpArgThrSerArgSerProHisAlaGlnGlyPheIleGluVal                               151015                                                                         AspGlnAsnValThrThrHisHisProIleValArgGluGluLysIle                               202530                                                                         ValProAsnMetGlnIleAsnGlyTyrGluAsnProThrTyrLysTyr                               354045                                                                         PheGluValLysGlu                                                                50                                                                             (2) INFORMATION FOR SEQ ID NO:15:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 47 amino acids                                                     (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                       ArgLysArgGlnTyrGlyThrIleSerHisGlyIleValGluValAsp                               151015                                                                         ProMetLeuThrProGluGluArgHisLeuAsnLysMetGlnAsnHis                               202530                                                                         GlyTyrGluAsnProThrTyrLysTyrLeuGluGlnMetGlnIle                                  354045                                                                         (2) INFORMATION FOR SEQ ID NO:16:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                       GCAACCGAATGGACAGGGTA20                                                         (2) INFORMATION FOR SEQ ID NO:17:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                       CAAGGCAGCCAGGTAGTTCTC21                                                        (2) INFORMATION FOR SEQ ID NO:18:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 23 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                       GTAAAGAAGGAATGGGAAGAGGC23                                                      (2) INFORMATION FOR SEQ ID NO:19:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 23 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                       CCATCCGACGGCGGTCATTCAGC23                                                      __________________________________________________________________________ 

What is claimed is:
 1. A recombinant nuleotide molecule encoding an amyloid precursor-like protein (APLP) polypeptide, comprising a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:4, or a nucleotide sequence complementary thereto.
 2. The recombinant nucleotide molecule of claim 1, wherein said nucleotide molecule comprises cDNA or genomic DNA.
 3. The recombinant nucleotide molecule of claim 1, wherein said nucleotide molecule comprises RNA.
 4. The recombinant nucleotide molecule of claim 1, wherein said nucleotide molecule codes for an APLP polypeptide derived from brain cells of a human.
 5. A recombinant vector, comprising the recombinant nucleotide molecule of claim 1, wherein said recombinant vector expresses said recombinant nucleotide molecule in a host cell to produce the APLP polypeptide.
 6. The nucleotide vector of claim 5, wherein said APLP polypeptide is APLP2 having the amino acid sequence of SEQ ID NO:4.
 7. A host cell transformed with the recombinant nucleotide molecule of claim 1, which host cell expresses said nucleotide molecule to produce the APLP polypeptide.
 8. A method of producing an APLP polypeptide, said method comprising:(a) culturing a host cell comprising the recombinant vector of claim 5 under conditions wherein said APLP polypeptide encoded by said recombinant vector is expressed in said host cell; and (b) isolating the APLP polypeptide produced in step (a), wherein said APLP polypeptide is APLP2 having the amino acid sequence of SEQ ID NO:4.
 9. The method of claim 8, wherein said host cell is a mammalian cell. 